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New US patent application 

Axiogenesis AG ' • 

Our Ref.: AX02A03/P-US EV32T47fiTb5US 
Secreted Proteins As Markers For Cell Differentiation 

5 Field of the invention 

The present invention relates to the use of proteins capable of being secreted across the cell 
membrane as cell-type specific reporters during cell development. In detail, the present 
invention relates to embryonic stem (ES) cells stably transfected with a DNA construct 
comprising a DNA sequence encoding a secreted protein, which is operably linked to a cell- 

10 and/or development-dependent promoter; a method for preparing such ES cells; a cell culture 
obtainable by culturing said ES cells; a method for the toxicological examination of 
substances using such cell cultures as well as methods for the identification of differentiation 
and growth factors for induction of selectively differentiated cells which can be used as a 
source for tissue grafts; a method for producing transgenic non-human mammals using said 

15 ES cells; a transgenic non-human mammal obtainable by said method; and a method for 
examining stages of cellular development using cells of such a non-human mammal. The 
present invention also concerns assays to identify culture conditions and substances that 
promote the differentiation into a desired cell type, or to determine the toxicity of a compound 
for certain cell types. Those assays are particularly suited to give indications of possible 

20 developmental and differentiation disturbances during embryogenesis. 

Background art 

Precursor cells have become a central interest in medical research. On the one hand precursors 
can replace cells that are senescent or damaged by injury or disease and on the other hand 

25 these cells represent an ideal model for studying development and differentiation and the 
factors influencing these processes. Employing conventional cell lines for these studies has 
the disadvantage that individual cell lines may not be folly representative of the complex 
biology of an intact organism. Moreover, even repeating the tests in multiple cell lines does 
not reproduce or account for the complex interactions among cells and tissues that occurs in 

30 an organism. 

This is the main reason why toxicity testing in cell lines can not replace conventional animal 
tests. New chemical compounds are constantly being developed and tested on animals. In 
addition to industrial and household chemicals, a number of chemical compositions are 
developed each year for use as pharmaceuticals. Rules regarding the testing of potential 
35 pharmaceuticals are promulgated by the Food and Drug Administration ("FDA"), which 
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currently requires comprehensive testing of toxicity, mutagenicity, and other effects in at least 
two species before a drug candidate can be entered into human clinical trials. Preclinical 
toxicity testing alone costs some hundreds of thousands of dollars. Despite this huge 
investment, almost one third of all prospective human therapeutics fail in the first phase of 

5 human clinical trials because of unexpected toxicity. It is clear that currently available 
toxicological screening assays do not detect all toxicities associated with human therapy or 
exposure to chemicals in the environment. Better means of screening potential therapeutics or 
chemicals in general for potential toxicity would reduce the cost and uncertainty of 
developing new therapeutics and materials, for example for use in medical devices or in other 

1 0 devices or goods humans are exposed to every day. 

The detection of teratogenic and/or embryotoxic properties of chemical agents occurs 
presently by determination of the reproduction toxicity of test substances following single or 
multi-administrations to pregnant laboratory mammals and by tests of the embryotoxicity in 

15 the early stages of pregnancy. Furthermore, in vitro tests are performed with mammal 
embryos (Neubert and Merker, de Gruyter, Berlin-New York (1981)) and with embryonic 
organs for teratogenicity tests. These test procedures have however the disadvantage that they 
require the use of a large number of live mammals, in particular rats and mice. In vitro test 
procedures, in which primary cell cultures of limb buds (for example, "Limb Buds", Kochhar, 

20 Teratology 11 (1975), 273-287), or brain parts of embryonic rats (Flint and Orton, Toxicol. 
Appl. Pharmacol. 76 (1984), 383-395) or permanent cell lines of embryonic or adult mammal 
tissue, such as tumor cells of the ovary or embryonic palate cells are employed, do not fulfill, 
strictly speaking, the requirements which are imposed on the teratogenicity tests during the 
embryogenesis, namely giving indications of possible disgenesis or developmental 

25 disturbances. 

Efforts have been made for a couple of years to employ permanent cultures of 
totipotent/pluripotent embryonic stem cells (ES cells) for the detection of embryotoxic and 
mutagenic substances; see, e.g. Laschinski et al., Reproductive Toxicol. 5 (1991), 57-65; 
30 Newall and Beedles, Toxicol, in Vitro 8 (1994), 697-701; Sehlmeyer and Wobus, Mutation 
Res. 324 (1994), 69-76. 

All those systems make use of the biological properties of the test ES cells or of a reporter 
system that has been introduced into the cells. 



For example, US-A-6,498,018 describes a method for determining the effect of a biological 
agent by contacting a cell culture with a biological agent. The cell culture contains human 
multipotent CNS neural stem cells that are derived from primary CNS neural tissue and a 
culture medium with preselected growth factors. The read-out is provided by the effect of a 
5 biological agent on the presence or absence on a biological function or property ascribable to 
the cell culture. A major disadvantage of this system is the fact that particular biological 
functions or properties inherent to a certain culture of cells are difficult to measure and often 
involve the destruction of a large part of the culture in order to obtain enough material for the 
assay. 

10 WO02/086073 discloses a method for the positive selection of neuronal cells differentiated 
from nuclear transfer embryonic stem cells by taking advantage of the neural stem cell marker 
nectin. This method is limited to neural cell types expressing nectin naturally. 
US-A-6,007,993 describes an in vitro test procedure for the detection of chemically-induced 
embryotoxic (for example also teratogenic) effects based on differentiated pluripotent 

15 embryonic stem (ES) cells from the mouse and rat and using embryonic germ (EG) cells 
obtained established from primordial germ cells. Stable transgenic ES or EG stem cell clones 
are constructed, wherein a bacterial reporter gene, LacZ or the luciferase gene, is brought 
under the control of tissue-specific promoters or developmental control genes. Following 
differentiation of the ES cells in the presence of teratogenic substances into the different 

20 germination path derivatives, there occurs a differentiation-dependent expression in the cells, 
due to the activity of the tissue-specific promoters. The activation, repression or modulation 
of these tissue-specific genes is detected based on a reaction depending on the reporter gene 
employed, for example the X-Gal assay. 

WO99/01552 discloses embryonic stem (ES) cells, which are transfected in a stable manner 
25 with a DNA construct encoding a non-cell damaging fluorescent protein and operatively 
linked thereto a cell- or development-dependent promoter. Also disclosed is a method for 
toxicological monitoring of substances using these ES cell cultures. 

Although the above described methods employ semi-quantitative as well as relatively simple 
30 and robust assays, those assays usually involve the destruction of the tested cells. The 
preparation of a cell lysate required by many of the described conventional assay systems 
results in a high intra- as well as inter-assay variability. For repeated measurements during the 
course of differentiation many parallel cultures have to be assayed with the associated 
problems of variability. Laborious preparation of lysates also renders the assay less amenable 
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to high-throughput formats. Using fluorescent reporter proteins can overcome some of those 
problems. However, GFP expression has already shown some toxic effects (Liu et al. 
Biochem. Biophys. Res. Comm. 260 (1999), 712-717). Another disadvantage is the auto- 
fluorescence of aggregates of ES cells (embryoid bodies) which gives a high background in 
5 fluorescence measurements. 

The technical problem of readily monitoring cells differentiating into a certain cell type and 
thereby providing a reliable and highly sensitive test system for toxicity screening, 
differentiation promoting agents and/or appropriate culture conditions, medium supplements, 
10 is achieved by providing the embodiments characterized in the claims, and described further 
below. 

Summary of the invention 

The present invention relates to a method of monitoring cell differentiation comprising 
15 culturing cells capable of differentiating into at least one particular cell type, wherein said 
cells contain at least one recombinant nucleic acid molecule comprising a reporter gene 
encoding a product that is secreted upon cell differentiation, under conditions allowing 
differentiation of the cells, or maintaining a non-human animal comprising such cells under 
appropriate conditions; and determining the amount or activity or amount of the reporter gene 
20 product either within a body fluid of said transgenic non-human animal or the cell culture 
medium or sample thereof. 

The present invention also concerns the mentioned reporter gene constructs for monitoring 
cell differentiation, as well as cells, aggregates of at least one cell type, tissues comprising 
25 those cells or aggregates, organs comprising said cells or tissue, implants and transplants 
comprising said organ, all of which are characterized by comprising such a reporter gene 
construct. 

Furthermore, the present invention relates to a non-human animal and a composition of matter 
30 both characterized by comprising a reporter gene construct, a cell, a cell aggregate, a tissue 
and/or an organ described herein. 



In addition, the instant invention relates to an array or chip which contains the cells, cell 
aggregates or tissues of the invention attached to a solid support or suspended thereon as well 
as to an apparatus for analyzing said array or chip. 

5 In particular, the present invention relates to test systems to identify substances that influence 
the differentiation of cells into certain cell types. Therefore, the present invention provides a 
method for obtaining and/or profiling a modulator of cell differentiation. This method 
comprises contacting a test sample containing a cell, cell aggregate, tissue, organ or a non- 
human transgenic animal as defined above with the substance to be tested; and then 
10 determining the effect of the test substance on the amount of the reporter gene product or 
activity compared to a control sample or animal. 

The test system provided by the present invention is useful for drug screening purposes. It is 
envisaged to obtain and manufacture drugs which promote or inhibit the formation of said 
1 5 specific cell types using the method described above, wherein an enhanced or reduced level or 
activity of the reporter gene product is indicative for the drug. An enhanced or reduced level 
or activity of the reporter gene product, respectively, can also be indicative for an agent or 
compound which supports wound healing and/or healing of damaged tissue or toxicity of 
compounds, respectively. 

20 

The present invention additionally relates to a kit useful for conducting the methods described 
above, which contains, for example, a reporter gene construct and/or a cell as defined above, 
and standard compounds, like cell culture media, selection agents, detection agents for the 
reporter molecule and control samples. 

25 

Furthermore, methods of conducting a drug discovery and/or a target discovery business are 
disclosed. Both methods involve providing one or more assay systems described above for the 
identification of modulators of cell differentiation; and/or conducting therapeutic profiling of 
the modulators identified, or further analogs thereof, for efficacy and toxicity in the transgenic 
30 animals provided by the present invention. The modulators identified using the methods 
provided by the invention are then either formulated in a pharmaceutical composition or 
licensed to a third party for further drug development and/or sales. 



The present invention also relates to modulators of cell differentiation which have been 
identified using the methods of the invention as well as to pharmaceutical compositions 
comprising such modulators for use in the modulation of cell differentiation and the treatment 
of corresponding diseases. 

5 

Furthermore, the present invention relates to the use of a reporter gene construct, a cell, a cell 
aggregate, tissue, organ, implant or transplant, a non-human animal, a composition, an array 
or an apparatus provided herein in drug discovery, pharmacokinetic or pharmacological 
profiling. 

10 

Brief description of the figure 

Figure 1: SEAP expression driven by the cardiac-specific aMHC promoter by cells 
differentiated from ES cells in the presence of retinoic acid. 
ES cells transfected with the a-MHC-SEAP vector (containing the SEAP gene 

15 under the control of the heart-specific aMHC promoter) were differentiated in 

the presence of all-trans retinoic acid at concentrations indicated. After 13 days 
culture supernatants were assayed for SEAP activity. Shown are mean values of 
duplicate samples from which background SEAP activity was substracted. 
Activities are expressed as percentage of SEAP activity in samples containing 

20 only solvent (DMSO) but no retinoic acid. 



Definitions 

For the purposes of this description, the term "stem cell" can refer to either stem cell or germ 
cell, for example embryonic stem (ES) and germ (EG), respectively. Minimally, a stem cell 

25 has the ability to proliferate and form cells of more than one different phenotype, and is also 
capable of self renewal-either as part of the same culture, or when cultured under different 
conditions. Embryonic stem cells are also typically telomerase positive and OCT-4 positive. 
Telomerase activity can be determined using TRAP activity assay (Kim et al., Science 266 
(1997), 2011), using a commercially available kit (TRAPeze(R) XK Telomerase Detection 

30 Kit, Cat. s7707; Intergen Co., Purchase N.Y.; or TeloTAGGG(TM) Telomerase PCR 
ELISAplus, Cat. 2,013,89; Roche Diagnostics, Indianapolis). hTERT expression can also be 
evaluated at the mRNA level by RT-PCR. The LightCycler TeloTAGGG(TM) hTERT 
quantification kit (Cat. 3,012,344; Roche Diagnostics) is available commercially for research 
purposes. 
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In accordance with the present invention, the term embryonic stem (ES) cell includes any 
multi- or pluripotent stem cell derived from pre-embryonic, embryonic, or fetal tissue at any 
time after fertilization, and having the characteristic of being capable under appropriate 
5 conditions of producing progeny of several different cell types that are derivatives of all of the 
three germinal layers (endoderm, mesoderm, and ectoderm), according to a standard art- 
accepted test, such as the ability to form a teratoma in 8-12 week old SCID mice. 
"Embryonic germ cells" or "EG cells" are cells derived from primordial germ cells. The term 
"embryonic germ cell" is used to describe cells of the present invention that exhibit an 

10 embryonic pluripotent cell phenotype. The terms "human embryonic germ cell (EG)" or 
"embryonic germ cell" can be used interchangeably herein to describe mammalian, preferably 
human cells, or cell lines thereof, of the present invention that exhibit a pluripotent embryonic 
stem cell phenotype as defined herein. Thus, EG cells are capable of differentiation into cells 
of ectodermal, endodermal, and mesodermal germ layers. EG cells can also be characterized 

15 by the presence or absence of markers associated with specific epitope sites identified by the 
binding of particular antibodies and the absence of certain markers as identified by the lack of 
binding of certain antibodies. 

"Pluripotent" refers to cells that retain the developmental potential to differentiate into a wide 
20 range of cell lineages including the germ line. The terms "embryonic stem cell phenotype" 
and "embryonic stem-like cell" also are used interchangeably herein to describe cells that are 
undifferentiated and thus are pluripotent cells. 

Included in the definition of ES cells are embryonic cells of various types, exemplified by 
25 human embryonic stem cells, described by Thomson et al. (Science 282 (1998), 1145); 
embryonic stem cells from other primates, such as Rhesus stem cells (Thomson et al., Proc. 
Natl. Acad. Sci. USA 92 (1995), 7844), marmoset stem cells (Thomson et al., Biol. Reprod. 
55 (1996), 254) and human embryonic germ (hEG) cells (Shamblott et al., Proc. Natl. Acad. 
Sci. USA 95 (1998), 13726). Other types of pluripotent cells are also included in the term. 
30 Any cells of mammal origin that are capable of producing progeny that are derivatives of all 
three germinal layers are included, regardless of whether they were derived from embryonic 
tissue, fetal tissue, or other sources. The stem cells employed in accordance with the present 
invention that are preferably (but not always necessarily) karyotypicaliy normal. However, it 
is preferred not to use ES cells that are derived from a malignant source. 



"Feeder cells" or "feeders" are terms used to describe cells of one type that are co-cultured 
with cells of another type, to provide an environment in which the cells of the second type can 
grow. The feeder cells are optionally from a different species as the cells they are supporting. 
5 For example, certain types of ES cells can be supported by primary mouse embryonic 
fibroblasts, immortalized mouse embryonic fibroblasts(such as murine STO cells, e.g., Martin 
and Evans, Proc. Natl. Acad. Sci USA 72 (1975), 1441-1445), or human fibroblast-like cells 
differentiated from human ES cells, as described later in this disclosure. The term "STO cell" 
refers to embryonic fibroblast mouse cells such as are commercially available and include 
1 0 those deposited as ATCC CRL 1503. 

The term "embryoid bodies" (EBs) is a term of art synonymous with "aggregate bodies". The 
terms refer to aggregates of differentiated and undifferentiated cells that appear when ES cells 
overgrow in monolayer cultures, or are maintained in suspension cultures. Embryoid bodies 

15 are a mixture of different cell types, typically from several germ layers, distinguishable by 
morphological criteria; see also infra. As used herein, "embryoid body", "EB"or "EB cells" 
typically refers to a morphological structure comprised of a population of cells, the majority 
of which are derived from embryonic stem (ES) cells that have undergone differentiation. 
Under culture conditions suitable for EB formation (e.g., the removal of Leukemia inhibitory 

20 factor or other, similar blocking factors), ES cells proliferate and form small mass of cells that 
begin to differentiate. In the first phase of differentiation, usually corresponding to about days 
1-4 of differentiation for humans, the small mass of cells forms a layer of endodermal cells on 
the outer layer, and is considered a "simple embryoid body". In the second phase, usually 
corresponding to about days 3-20 post-differentiation for humans, "complex embryoid 

25 bodies" are formed, which are characterized by extensive differentiation of ectodermal and 
mesodermal cells and derivative tissues. As used herein, the term "embryoid body" or "EB" 
encompasses both simple and complex embryoid bodies unless otherwise required by context. 
The determination of when embryoid bodies have formed in a culture of ES cells is routinely 
made by persons of skill in the art by, for example, visual inspection of the morphology. 

30 Floating masses of about 20 cells or more are considered to be embryoid bodies; see. e.g., 
Schmitt et al., Genes Dev. 5 (1991), 728-740; Doetschman et al. J. Embryol. Exp. Morph. 87 
(1985), 27-45. It is also understood that the term "embryoid body", "EB", or "EB cells" as 
used herein encompasses a population of cells, the majority of which being pluripotent cells 
capable of developing into different cellular lineages when cultured under appropriate 



conditions. As used herein, the term also refers to equivalent structures derived from 
primordial germ cells, which are primitive cells extracted from embryonic gonadal regions; 
see, e.g., Shamblott, et al. (1998), supra. Primordial germ cells, sometimes also referred to in 
the art as EG cells or embryonic germ cells, when treated with appropriate factors form 
5 pluripotent ES cells from which embryoid bodies can be derived; see, e.g., US Patent US-A- 
5,670,372; Shamblott, et al., supra. 

The terms "polynucleotide" and "nucleic acid molecule" refer to a polymer of nucleotides of 
any length. Included are genes and gene fragments, mRNA, tRNA, rRNA, ribozymes, cDNA, 

10 recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA and 
RNA, nucleic acid probes, and primers. As used in this disclosure, the term polynucleotides 
refer interchangeably to double- and single-stranded molecules. Unless otherwise specified or 
required, any embodiment of the invention that is a polynucleotide encompasses both a 
double-stranded form, and each of the two complementary single-stranded forms known or 

15 predicted to make up the double-stranded form. Included are nucleic acid analogs such as 
phosporamidates and thiophosporamidates. 

A cell is said to be "genetically altered", "transfected", or "genetically transformed" when a 
polynucleotide has been transferred into the cell by any suitable means of artificial 
20 manipulation, or where the cell is a progeny of the originally altered cell that has inherited the 
polynucleotide. The polynucleotide will often comprise a transcribable sequence encoding a 
protein of interest, which enables the cell to express the protein at an elevated level. The 
genetic alteration is said to be "inheritable" if progeny of the altered cell have the same 
alteration. 

25 

A "regulatory sequence" or "control sequence" is a nucleotide sequence involved in an 
interaction of molecules that contributes to the functional regulation of a polynucleotide, such 
as replication, duplication, transcription, splicing,, polyadenylation, translation, or 
degradation of the polynucleotide. Transcriptional control elements include promoters, 
30 enhancers, and repressors. 



Particular gene sequences referred to as promoters, like the "aMHC" or "collagen" promoter, 
are polynucleotide sequences derived from the gene referred to that promote transcription of 
an operatively linked gene expression product. It is recognized that various portions of the 



10 

upstream and intron untranslated gene sequence may in some instances contribute to promoter 
activity, and that all or any subset of these portions may be present in the genetically 
engineered construct referred to. The promoter may be based on the gene sequence of any 
species having the gene, unless explicitly restricted, and may incorporate any additions, 
5 substitutions or deletions desirable, as long as the ability to promote transcription in the target 
tissue. Genetic constructs designed for treatment of humans typically comprise a segment that 
is at least 90% identical to a promoter sequence of a human gene. A particular sequence can 
be tested for activity and specificity, for example, by operatively linking to a reporter gene; 
see Figure 1 . 

10 

According to the present invention, the term "cell- and/or development-dependent promoter" 
is intended to mean a promoter which displays its promoter activity only in particular cell 
types and/or only in particular stages of cellular development, both in cell cultures (embryoid 
bodies) and in transgenic non-human mammals derived from the ES cells according to the 
15 invention. In addition, any other known cell-specific promoter can be employed, e.g. for nerve 
cells, heart cells, neurons, glia cells, hematopoietic cells, endothelial cells, smooth muscle 
cells, skeletal muscle cells, cartilage cells, fibroblasts and epithelial cells. 

Genetic elements are said to be "operatively linked" if they are in a structural relationship 
20 permitting them to operate in a manner according to their expected function. For instance, if a 
promoter helps initiate transcription of the coding sequence, the coding sequence can be 
referred to as operatively linked to (or under control of) the promoter. There may be 
intervening sequence between the promoter and coding region so long as this functional 
relationship is maintained. 

25 

In the context of encoding sequences, promoters, and other genetic elements, the term 
"heterologous" indicates that the element is derived from a genotypically distinct entity from 
that of the rest of the entity to which it is being compared. For example, a promoter or gene 
introduced by genetic engineering techniques into an animal of a different species is said to be 
30 a heterologous polynucleotide. An "endogenous" genetic element is an element that is in the 
same place in the chromosome where it occurs in nature, although other elements may be 
artificially introduced into a neighboring position. 
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The terms "polypeptide", "peptide" and "protein" are used interchangeably in this disclosure 
to refer to polymers of amino acids of any length. The polymer may comprise modified amino 
acids, it may be linear or branched, and it may be interrupted by non-amino acids. 

5 If not stated otherwise the terms "compound", "substance" and "(chemical) composition" are 
used interchangeably herein and include but are not limited to therapeutic agents (or potential 
therapeutic agents), agents of known toxicities such as neurotoxins, hepatic toxins, toxkis of 
hematopoietic cells, myotoxins, carcinogens, teratogens, or toxins to one or more 
reproductive organs. The chemical compositions can further be agricultural chemicals, such as 
10 pesticides, fungicides, nematicides, and fertilizers, cosmetics, including so-called 
"cosmeceuticals", industrial wastes or by-products, or environmental contaminants. They can 
also be animal therapeutics or potential animal therapeutics. 

Industrial products that can be tested with the methods of the present invention include 
bleaches, toilet, blocks, washing-up liquids, soap powders and liquids, fabric conditioners, 

1 5 window, oven, floor, bathroom, kitchen and carpet cleaners, dishwater detergents and rinse 
aids, watersoftening agents, descalers, stain removers, polishes, oil products, paints, paint 
removers, glues, solvents, varnishes, air fresheners, moth balls and insecticides. 
New ingredients for household products are constantly being developed and needed to be 
tested. For example, in recent years new enzymes (to digest stains) and "optical brighteners" 

20 (which make washing appear whiter) have been developed for use in washing powders and 
liquids. New surfactants (which cut through grease to remove ingrained dirt) and chemical 
"builders" (which act as water softeners and enable surfactants to work more effectively) have 
been developed for use in washing powders and liquids, washing-up liquids and various 
cleaning agents. But also medical materials have to be tested, for example dental materials 

25 such as new filling polymers, metal alloys, and bioactive ceramic. Furthermore, chemical 
compositions of any part of a device, such as catheters, electrodes, adhesives, paste, gel or 
cream may be tested with the method of the present invention in different concentrations and 
with different ingredients and impurities present. 

30 As used herein, "molecular profile" or "profile" of a chemical composition or compound 
refers to a pattern of alterations in gene or protein expression, or both, in an ES cell, embryoid 
body, tissue, etc. contacted by the chemical composition compared to a like cell, embryoid 
body or tissue in contact only with culture medium. 
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Detailed description 

Stem cells of various kinds have become an extremely attractive modality in regenerative 
medicine. They can be proliferated in culture, and then differentiated in vitro or in situ into 
the cell types needed. This plasticity makes them ideal models for toxicity testing as well as 
5 investigation of pharmaceuticals such as tissue specific promoting factors. Particularly 
embryoid bodies which consist of different cell types of the three germ layers that interact 
with each other provide a highly sensitive test system. The ability of the stem cells to 
differentiate into the different cell types in the presence of a test substance may depend on the 
ability of the substance to promote differentiation and/or the toxicity of said substance. 
10 This method of monitoring cell differentiation comprises: 

- transfecting cells capable of differentiating into at least one particular cell type with at 
least one recombinant nucleic acid molecule comprising a reporter gene encoding a 
product that is secreted upon cell differentiation; 

- introducing the cells into a non-human animal or culturing the cells under conditions 
15 allowing differentiation of the cells; and 

- determining the amount or activity of the reporter gene product either within a body fluid 
of said transgenic non-human animal or the cell culture medium. 

The present invention is based on the use of a reporter gene that is secreted. Surprisingly it 
was found that secreted reporter molecules have certain advantages over the use of, for 

20 example, a fluorescent reporter like Green Fluorescent Protein (GFP) and its derivatives. Liu 
et al. (Biochem. Biophys. Res. Comm. 260 (1999) 712-717) observed toxic effects when 
using a GFP-reporter. Such toxic effects can be avoided or at least minimized if a secreted 
reporter molecule is used, simply by exchanging the culturing medium on a regular basis and 
thereby preventing toxic concentration of the respective reporter molecule building up. 

25 Also, a preferred method for the differentiation of cells is to grow them as embryoid bodies 
(EBs), which show a high autofluorescence rate even when the cells were lysed. This 
background can render the use of fluorescent reporter genes for the monitoring of cell 
differentiation quite inefficient. 

Other routinely used reporter molecules like for example beta-galactosidase or luciferase 
30 require the cells to be tested in a certain assay buffer which means that the cells have to be 
destroyed to determine the activity. The system according to the invention avoids this by 
employing a secreted reporter molecule so that assaying the culture medium of the 
differentiating cells or the body fluid an animal containing said cells is sufficient to monitor 
the reporter activity and thereby the differentiation. 
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For the transfection of the nucleic acid molecule encoding the reporter protein any standard 
method known to the person skilled in the art can be employed. Any suitable expression 
vector for this purpose can be used. Suitable viral vector systems for producing stem cells 
altered according to this invention can be prepared using commercially available virus 

5 components. The introduction of the vector construct or constructs into the cells occurs in a 
known manner, e.g. by transfection or transduction with the help of viral vectors. Transfection 
of vector plasmids into cells can be accomplished by, e.g., electroporation or lipofection, i.e. 
using lipid/DNA complexes. Exemplary is the formulation Lipofectamine 2000(TM), 
available from Gibco/Life Technologies. Another exemplary reagent is FuGENE(TM) 6 

10 Transfection Reagent, a blend of lipids in non-liposomal form and other compounds in 80% 
ethanol, obtainable from Roche Diagnostics Corporation. . Viral vectors comprising effector 
genes are generally described in the publications referenced in the last section. 

The cells to be transfected have to be able to differentiate into at least one particular cell type. 
This will be the case for the precursor cell for the desired cell type. Precursor cells include but 
are not limited to for example the human neuronal precursor cell line NTERA-2 cl.Dl. 
(Leypoldt et al., J. Neurochem. 76 (2001), 806-814), the human retinal precursor cell line 
KGLDMSM (Ezeonu et al., DNA Cell Biol. 19 (2000), 527-537), a human leukemic cell line, 
PER-117, bearing the markers of a T-cell precursor phenotype (Kees, Blood 72 (1988), 1524- 
1529) or HL60, a promyelocyte leukemia cell line that differentiates to mature neutrophils 
(Fontana, Proc. Natl. Acad. Sci. USA 77 (1980), 3664-3668). 

Since these precursor cell lines have only a limited range of developmental capacity it is 
preferred that multipotent or pluripotent cells are used. This is the case when said cells are or 
are derived from stem cells or primordial germ cells, which are preferred in the embodiments 
of the present invention. 

The invention can be practiced using stem cells of any vertebrate species. Included are stem 
cells from humans; as well as non-human primates, domestic animals, livestock, and other 
non-human mammals. Preferred sources for stem cells are rodents, in particular mice. 
30 Amongst the stem cells suitable for use in this invention are primate pluripotent stem cells 
derived from tissue formed after gestation, such as a blastocyst, or fetal or embryonic tissue 
taken any time during gestation. Non-limiting examples are primary cultures or established 
lines of embryonic stem cells. The invention is also applicable to adult stem cells, for example 
multipotent adult progenitor cells (MAPCs); see, e.g., Reyes and Verfaillie, Ann. N Y Acad. 
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Sci. 938 (2001), 231-235 and Weissman and Anderson, Annu. Rev. Cell. Dev. Biol. 17 
(2001), 387-403. It is also referred to the literature of Anderson et al., Nat. Med. 7 (2001), 
393-395 and Prockop, Science 276 (1997), 71-74, wherein the extraction and culture of adult 
stem cells is described. 

5 

Media for isolating and propagating stem cells can have any of several different formulas, as 
long as the cells obtained have the desired characteristics, and can be propagated further. 
Suitable sources include Iscove's modified Dulbecco's medium (IMDM), Gibco, 
#12440-053; Dulbecco's modified Eagles medium (DMEM), Gibco #11965-092; Knockout 

10 Dulbecco's modified Eagles medium (KO DMEM), Gibco #10829-018; 200 mM L- 
glutamine, Gibco # 15039-027; non-essential amino acid solution, Gibco 11140-050; [beta]- 
mercaptoethanol, Sigma # M7522; human recombinant basic fibroblast growth factor (bFGF), 
Gibco # 13256-029. Exemplary serum-containing ES medium and conditions for culturing 
stem cells are known, and can be optimized appropriately according to the cell type. Media 

1 5 and culture techniques for particular cell types referred to in the previous section are provided 
in the references cited herein. 

Because of their wide range of developmental capacity said stem cells are embryonic stem 
(ES) cells in a particularly preferred embodiment of this invention. 

20 As mentioned before, several sources for ES cells are at the disposal of the skilled person of 
which human stem cells are preferred for most of the embodiments of the present invention. 
Human embryonic stem cells and their use for preparing different cell and tissue types are 
also described in Reprod. Biomed. Online 4 (2002), 58-63. Embryonic stem cells can be 
isolated from blastocysts of members of the primate species (Thomson et al., Proc. Natl. 

25 Acad. Sci. USA 92 (1995), 7844). Human Embryonic Germ (EG) cells can be prepared from 
primordial germ cells present in human fetal material taken about 8-11 weeks after the last 
menstrual period. Suitable preparation methods are described in Shamblott et al., Proc. Natl. 
Acad. Sci. USA 95 (1998), 13726. Methods for making cells that resemble embryonic stem 
cells or embryonic germ cells in morphology and pluripotency derived from primordial germ 

30 cells isolated from human embryonic tissue, such as from the gonadal ridges of human 
embryo, are described in US patent US-A-6,245,566. 

Recently, is has been reported that exfoliated human deciduous tooth, a comparably very 
accessible tissue, contains multipotent stem cells that were identified to be a population of 
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highly proliferative, clonogenic cells capable of differentiating into a variety of cell types 
including neural cells, adipocytes, and odontoblasts; see Miura et al., Proc. Natl. Acad. Sci. 
USA, (2003), Apr 25; p S0027-8424. After in vivo transplantation, those cells were found to 
be able to induce bone formation, generate dentin, and survive in mouse brain along with 
5 expression of neural markers. Furthermore, multilineage potential of homozygous stem cells 
derived from metaphase n oocytes has been described in by Lin et al. in Stem Cells 21 
(2003), 152-161. Various sources of precursor cells in postnatal muscles and the factors that 
may enhance stem cell participation in the formation of new skeletal and cardiac muscle in 
vivo are reviewed in Grounds et al. J. Histochem. Cytochem. 50 (2002), 589-610. Purification 

10 of rare Hematopoietic Stem Cell(s) (HSC) to homogeneity that home to bone marrow is 
described in US2003/0032185. These adult bone marrow cells are described to have 
tremendous differentiative capacity as they can also differentiate into epithelial cells of the 
liver, lung, gastro-intestinal tract, and skin. This finding may contribute to clinical treatment 
of genetic disease or tissue repair. Furthermore, techniques such as nuclear transfer for 

15 embryo reconstruction may be employed wherein diploid donor nuclei are transplanted into 
enucleated Mil oocytes. This technology along with other procedures that aid in the 
establishment of customized embryonic stem (ES) cell lines that are genetically identical to 
those of the recipient have been reviewed by Colman and Kind, Trends BiotechnoL 18 
(2000), 192-196. 

20 The field of stem cell technology is being reviewed by Kiessling and Anderson, Harvard 
Medical School, in Human Embryonic Stem Cells: An Introduction to the Science and 
Therapeutic Potential; (2003) Jones and Bartlett Publishers; ISBN: 076372341X. 

In order to avoid the use of for example human embryos as the donor for stem cells, which 
25 however seems to be justifiable at least under certain circumstances, it may even be possible 
to employ transgenic non-human animals, in particular mammals as source for embryonic 
stem cells. For example, compositions and methods for making transgenic swines to be used 
as xenograft donors is described in US patent US-A-5,523,226. Likewise, WO97/12035 
describes methods of producing transgenic animals for xenotransplantation. Furthermore, 
30 immunologically compatible animal tissue, suitable for xenotransplantation into human 
patients, is described in WO01/88096. Methods for making embryonic germ cells from 
porcine are described for example in US patent US-A-6,545,199. Cells immunologically 
compatible with humans can also be employed for purposes of the present invention. 
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1 i 

Stem cells can be propagated continuously in culture, using a combination of culture 
conditions that promote proliferation without promoting differentiation. Traditionally, stem 
cells are cultured on a layer of feeder cells, typically fibroblast type cells, often derived from 
embryonic or fetal tissue. The cell lines are plated to near confluence, usually irradiated to 
5 prevent proliferation, and then used to support when cultured in medium conditioned by 
certain cells (e.g. Koopman and Cotton, Exp. Cell 154 (1984), 233-242; Smith and Hooper, 
Devel. Biol. 121 (1987), 1-91), or by the exogenous addition of leukemia inhibitory factor 
(LIF). Such cells can be grown relatively indefinitely using the appropriate culture conditions 
without differentiation. 

10 

In the absence of feeder cells, exogenous leukemia inhibitory factor (LIF), or conditioned 
medium, ES or EG cells spontaneously differentiate into a wide variety of cell types, 
including cells found in each of the endoderm, mesoderm, and ectoderm germ layers. With 
the appropriate combinations of growth and differentiation factors, however, cell 

15 differentiation can be controlled. For example, mouse ES and EG cells can generate cells of 
the hematopoietic lineage in vitro (Keller et al., Mol. Cell. Biol. 13 (1993), 473-486; Palacios 
et al., Proc. Natl. Acad. Sci. USA 92 (1995), 7530-7534; Rich, Blood 86 (1995), 463-472). 
Additionally, mouse ES cells have been used to generate in vitro cultures of neurons (Bain et 
al, Developmental Biology 168 (1995), 342-357; Fraichard et al, J. Cell Science 108 (1995), 

20 3161-3188), cardiomyocytes (heart muscle cells) (Klug et al., Am. J. Physiol. 269 (1995), 
H1913-H1921), skeletal muscle cells (Rohwedel et al, Dev. Biol. 164 (1994), 87-101), 
vascular cells (Wang et al, Development 114 (1992), 303-316), US patent US-A-5,773,255 
relates to glucose-responsive insulin secreting pancreatic beta cell lines, US patent US-A- 
5,789,246 relates to hepatocyte precursor cells. Hepatic differentiation of murine embryonic 

25 stem cells is also described in Jones et al., Exp. Cell Res. 272 (2002), 15-22. 

Other progenitors of interest include but are not limited to chondrocytes, osteoblasts, retinal 
pigment epithelial cells, fibroblasts, skin cells such as keratinocytes, dendritic cells, hair 
follicle cells, renal duct epithelial cells, smooth and skeletal muscle cells, testicular 
progenitors, and vascular endothelial cells. 

30 

In certain embodiments of the invention, differentiation is promoted by withdrawing one or 
more medium component(s) that promote(s) growth of undifferentiated cells, or act(s) as an 
inhibitor of differentiation. Examples of such components include certain growth factors, 
mitogens, leukocyte inhibitory factor (LIF), and basic fibroblast growth factor (bFGF). 
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Differentiation may also be promoted by adding a medium component that promotes 
differentiation towards the desired cell lineage, or inhibits the growth of cells with undesired 
characteristics. 

5 In a preferred embodiment of the present invention, said reporter gene in the recombinant 
nucleic acid molecule described above is operably linked to at least one cell type specific 
regulatory sequence. This embodiment has the advantage of providing not only a monitoring 
system for differentiation per se but also allows observing the differentiation into a certain 
cell type. The regulatory sequence will drive the expression of the reporter molecule only in 

10 those cells that acquire the desired cell type by differentiation. Usually, said regulatory 
sequence comprises such promoter and/or enhancer elements. In addition, or alternatively, a 
second reporter gene can be used which is under the control of said cell type specific 
regulatory sequence. The second reporter gene may be the same as used for the differentiation 
dependent expression or different, including conventional reporter genes such as green 

15 fluorescent protein (GFP). Furthermore, the second reporter gene may be present in the 
mentioned recombinant nucleic acid molecule or on a different vector construct, the latter 
option requiring cotransfection of both molecules into the cells. 

In a further embodiment, the differentiated cells containing the recombinant nucleic acid 
20 molecule of the present invention, and optionally further foreign DNA such as vector 
constructs described above, are enriched and separated from non-differentiated cells and/or 
from cells of non-desired cell types. Thus, in accordance with this invention, populations of 
differentiated cells are depleted of relatively undifferentiated cells and/or of cells of undesired 
cell types by using a selection system that is lethal to the undesired cells and cell types. This 
25 can be achieved, for example, by expressing a selectable marker gene under the control of a 
regulatory sequence that causes the gene to be preferentially expressed in the desired cell type 
and/or at a certain stage of development, which renders cells of a specific cell type resistant to 
a lethal effect of an external agent. To accomplish this, the cells are genetically altered with a 
recombinant nucleic acid molecule comprising the selectable marker gene before the process 
30 used to differentiate the cells into the desired lineage. The selection system may be present in 
the recombinant nucleic acid molecule of the present invention or on a different vector 
construct which is used for cotransfection. 
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The cell types, which the differentiation can be directed to, include connecting fibroblasts, 
stromal cells, endothelial cells, glia cells, neural cells, neuronal cells, hematopoietic cells, 
smooth muscle cells, skeletal muscle cells, epithelial cells, and cardiac cells. To monitor 
differentiation into these particular cell types, regulatory sequences have to be operatively 
5 linked to the reporter gene sequence that confer an expression of the reporter which is specific 
for said cell types. 

In a preferred embodiment of the method according to the invention said promoter or 
enhancer is selected from the group consisting of aMHC, MLC2V, VE-cadherin 5 Tie-2, Flk-1, 

10 Fit- 1 , GFAP, alpha- 1- tubulin and collagen 2 promoter or enhancer. Whereas the pro-alphal 
(II) collagen chain (collagen 2) promoter/enhancer (Zhou et ah, J. Cell Sci. 108 (1995), 3677- 
3684) is specific for chondrocytes, the expression in endothelial cells can be driven by the 
VE-cad promoter (Gory et aL, Blood 93 (1999), 184-192), the Flt-1 promoter (Quinn et aL, 
Biochem. Biophys. Res. Commun. 276 (2000), 1089-1099), the Tie-2 (Schlaeger et aL, Proc. 

15 Natl. Acad. Sci. USA 94 (1997), 3058-3063) and the Flk-1 promoter/enhancer (Kappel et aL 
Blood 93 (1999), 4284-4292), respectively. The alpha- 1 -tubulin promoter (Gloster et aL, J. 
Neurosci. 14 (1994), 7319-7330) and the glial fibrillary acidic protein (GFAP) promoter 
(Besnard et aL, J. Biol. Chem. 266 (1991), 18877-18883) are specific for neural cells. The 
smooth muscle heavy chain (SMHC) minimal promoter is specific for smooth muscles 

20 (Kallmeier et aL, J. Biol. Chem. 270(1995), 30949-30957). An HSV-1 vector containing the 
rat tyrosine hydroxylase promoter for use in both long-term and cell type-specific expression 
in the midbrain is described by Song et aL, in J. Neurochem. 68 (1997), 792-803. Cardiac 
specific promoters such as alpha-myosin heavy chain promoter (aMHC) and the myosin light 
chain-2v (MLC2v) promoter, the latter being specific for ventricular heart muscles are 

25 described further below. 

Further examples for tissue specific promoters are those, which are active in hematopoietic 
cells, cartilage cells or epidermal cells as well as insulin secreting P-cells. 
"Tissue specific" is to be subsumed under the term "cell specific". Regulatory sequences of 
the above described cell types and other cell type specific promoters can be obtained from the 

30 literature; see, e.g., "medline" and NCBI. 



Cardiovascular diseases are still among the most frequent causes of death in the Western 
industrial countries. Only through intensive basic research in this field, the pathophysiological 
causes can be found, and new therapeutic approaches can be devised, and toxicological 
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changes described. For studying the pathogenesis of cardiovascular diseases and for testing 
new pharmacological and toxicological substances, models are needed which, on one hand, 
can be transferred to humans and, on the other hand, can replace the tedious and cost- 
intensive animal models. Still in the year 1991, over 2 million animals have been employed 

5 for animal experiments in the federal states of the former Western Germany alone. 

One starting point of pharmacological and toxicological research which has recently grown 
more and more important is heart differentiation. From the stereotypically proceeding heart 
cell development, conclusions can be drawn to pathological and toxicological changes of 
cardiomyocytes. Thus, for example, it is known that the receptor state and the intracellular 

10 signal cascades are disturbed in cardiac hypertrophy (Yamazaki et al., J. Mol. Cell Cardiol 27 
(1995), 133-140) and heart insufficiency (Johnson et al, Biochem. Pharmacol. 45 (1993), 
2365-2372). These pathologically altered cardiomyocytes are in part again similar to heart 
cells of early differentiation stages. Thus, besides toxicological tests assays for the 
identification and isolation of substances that are capable of promoting cardiac remodeling are 

15 needed, which can be used for therapeutic interventions in the treatment of patients with 
cardiac dysfunction characterized by, for example, pathological hypertrophy and fibrosis. 

However, the examinations necessary for elucidating the properties of heart cells in early 
differentiation stages are technically difficult to perform with live animals and, if they can be 

20 performed at all, it is only in very complicated studies: On day 12-13, at the earliest, it is 
possible to prepare cardiomyocytes from a murine embryo, but such cells no longer 
correspond to an early cardiac differentiation stage. A detailed analysis of receptor expression 
during various differentiation stages requires a very high expense of animal material and is 
technically difficult to perform as set forth above. Similarly, the observation of the 

25 development of a relatively undifferentiated heart cell over several days or weeks is not 
possible with an animal model. In order to test the use of novel therapeutical agents, e.g., 
inotropic substances or antiarrhythmics or toxical substances, e.g., heavy metals or retinoids, 
an invasive monitoring of animals, e.g., swine, must be performed for weeks in animal 
experiments. 

30 

Now, it has been the object of the present invention to provide a cell based assay which 
enables a simple characterization of the effects compounds on cardiomyogenesis and allows 
for functional examinations rather than being based on reporter genes such as Lac-Z (Niwa et 
al., Gene 108 (1991), 193-199; Wobus et al., J. Mol. Cell Cardiol. 29 (1997), 1525-1539; 
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Metzger et al., Circ. Res. 78 (1996), 547-552), as with prior methods, the expression of which 
genes can be detected only after cell fixation and by means of a specific substrate. 

Surprisingly, it has been found that ES cells can be stably transfected with a DNA construct in 
5 which a gene encoding a secreted reporter protein is coupled with a cardiomyocyte specific 
cell- and development-dependent promoter; see the examples and the vector sequence 
depicted in SEQ ID NO: 3. This construct is integrated in the native DNA and after the 
specific activation of intracellular signals, the promoter is activated and the reporter protein 
expressed and secreted into the medium. Thus, ES cells which activate a cell-specific 
10 transcription factor at a certain point of differentiation could be recognized by determining the 
level of reporter protein or its activity in the cell culture medium. 

Thus, in a particular preferred embodiment of the invention, said differentiating cell type are 
cardiomyocytes. For this embodiment, said cell type specific regulatory sequence is 

1 5 preferably atrial and/or ventricular specific. Corresponding regulatory sequences, i.e. cardiac 
specific promoters are described in the prior art; see also supra. For example Nkx-2.5 specific 
for very early cardiomyocytes and mesodermal precursor cells, respectively, (Lints et al., 
Development 119 (1993), 419 431); human-cardiac-a-actin specific for heart tissue, 
(Sartorelli et al., Genes Dev. 4 (1990), 181 1-1822), and MLC-2V specific for ventricular heart 

20 muscle cells (O'Brien et al., Proc. Natl. Acad. Sci. USA. 90 (1993), 5157 5161; Lee et al., 
Mol. Cell Biol. 14 (1994), 1220-1229; Franz et al., Circ Res. 73 (1993), 629-638 and WO-A- 
96/16163). The cardiac specific alpha-myosin heavy chain promoter is described in Palermo 
et al, Cell. Mol. Biol. Res. 41 (1995), 501-519 and Gulick et al., J. Biol. Chem. 266 (1991), 
9180-91855. The expression of the atrial specific myosin heavy chain AMHC1 and the 

25 establishment of anteroposterior polarity in the developing chicken heart is described in 
Yutzey et al., Development 120 (1994), 871-883. 

Since the detection is aided by using a reporter molecule for which a robust, sensitive and 
specific assay exists, a particularly preferred embodiment is the method according to the 
30 invention, wherein said reporter gene product is secreted alkaline phosphatase (SEAP), alpha- 
amylase or invertase. 

SEAP is heat-resistant and not inhibited by L-homoarginine. Further advantages of this 
reporter molecule are its low toxicity an the fact that the protein levels that are secreted 
correspond directly to the amount of intracellular mRNA. Thus the SEAP reporter assay 
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provides a direct measurement for the cell type-specific expression and thereby the 
differentiation. It is also extremely sensitive: it is possible to detect 10" 13 g SEAP protein 
(Yang et al. Biotechniques 23 (1997), 1110-1114). This makes it possible to detect 
differentiation into cell types that are rare. 

5 The use of alpha-amylase as reporter molecule is described in for example WO98/49320. This 
reporter molecule is particularly attractive because the enzyme is extremely stable over a wide 
range of conditions. The measurement of alpha-amylase activity is simple, quantitative, 
sensitive, safe and inexpensive. It was also shown to be compatible with a SEAP reporter 
system (Schlatter et al., Gene 282 (2002), 19-31). The use of invertase as a marker protein is 

10 described for example in Mace et al., Virology 188 (1992), 869-874. 

Reporter gene constructs comprising inter alia secreted alkaline phosphatase as the reporter 
molecule and, which therefore can be used as a source for preparing vector constructs 
according to the present invention are described for example in WO00/34435 and Wang et al., 

15 Gene 279 (2001), 99-108, the latter describing murine SEAP. Detection of SEAP level can be 
performed as described in the Examples and in the prior art; see, e.g., Yang et al. (1997) and 
Wang et al. (2001), supra. The use of human secreted alkaline phosphatase in an in vivo 
mouse reporter gene model to monitor ovarian tumor growth and response to therapeutics has 
been described in Nilsson et al., Cancer Chemother. Pharmacol. 49 (2002), 93-100. This 

20 model can be adapted to the embodiments of the present invention. Engineered secreted 
alkaline phosphatase reporter genes being capable of being expressed in a mammal for 
extended periods of time and detection of the alkaline phosphatase activity by measuring the 
protein levels or AP activity of a serum or tissue sample from a mammal containing the cell 
with the transgene, or measuring the media from a cultured cell containing the transgene are 

25 also described in WO02/095068. 

Reporter activity or protein amount, in particular SEAP activity, can be detected in any 
biological fluid such as breast milk, amniotic fluid, seminal plasma, serum and urine. 

Since not all molecules that can be used as a reporter are secreted naturally, the nucleic acid 
30 molecule introduced into the cells can be genetically engineered to encode a protein that is 
excreted. Particularly preferred is the method described above, wherein said reporter gene 
product comprises a secretory leader sequence. Examples for secretory leader sequences are 
but are not limited to the adenovirus tripartite leader sequence (Chu et al., Biotechniques 5 
(1995), 890-896), the immunoglobulin secretion signal (Schlatter et al., Gene 282 (2002), 19- 
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31), the SUC2 signal sequence as well as the human interleukin 2 receptor alpha chain signal 
sequence (Kamiya et al., Tohoku J. Exp. Med. 180 (1996), 297-308). 

In order to select those cells which contain the recombinant nucleic acid molecule according 
5 to the invention, and optionally other vector constructs such as those described hereinbefore, 
said recombinant nucleic acid molecule and vector constructs, respectively, preferably further 
comprise a selectable marker expressed by multi- or pluripotent cells. Selectable markers are 
known to persons skilled in the art and include for example nucleoside and aminoglycoside- 
antibiotic-resistance genes conferring resistance to, e.g. puromycin, neomycin, or 

10 hygromycin. Further examples for resistance genes are dehydrofolate-reductase, which 
confers a resistance against aminopterine and methotrexate, as well as multi drug resistance 
genes, which confer a resistance against a number of antibiotics, e.g. against vinblastin, 
doxorubicin and actinomycin D; see, e.g., Blau et al., Blood 89 (1997), 146-154, which 
describes the use of the multiple drug resistance gene 1 (MDR1) gene as a dominant 

1 5 selectable marker. 

In a particularly preferred embodiment of the present invention, said selectable marker 
confers resistance to puromycin. Puromycin is particularly suited for the fast elimination of 
non-cardiac cells in adherent culture of transgenic EBs. Furthermore, drug selection of cardiac 

20 cells can be implemented entirely in the suspension culture of transgenic EBs. Hence, it could 
also be shown that purified ES derived cardiomyocytes survive much longer in culture than 
untreated counterparts. Moreover, the elimination of undifferentiated ES cells during drug 
selection process has itself been shown to have clear positive effect on viability and longevity 
of such differentiated ES derived cells as cardiomyocytes. In addition, it could be surprisingly 

25 shown that the release from surrounding non-differentiated cells induces proliferation of 
cardiomyocytes. Thus, the drug selection possesses both purifying and multiplying effect; see, 
e.g., WO02/051987. Thus, the selection does not only provide the means for eliminating cells 
which can not be monitored since they do not carry the reporter gene, is also allows the 
elimination of cells that did not differentiate into the desired cell type provided that the 

30 reporter gene and the selectable marker gene are driven by the same, cell type-specific 
regulatory sequence. 

The method according to the invention, wherein said cells form cell aggregates or tissue-like 
aggregates derived from different ES cell types is a preferred embodiment and particularly 
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preferred are said cells forming embryoid bodies (EBs). As mentioned before, embryoid 
bodies represent a complex group of cells differentiating into different tissues. In one 
embodiment, the cells within an embryoid body are substantially synchronized for their 
differentiation. Accordingly, at known intervals, the majority of the synchronized cells 
5 differentiate into the three embryonic germ layers and further differentiate into multiple tissue 
types, such as cartilage, bone, smooth and striated muscle, and neural tissue, including 
embryonic ganglia; see also Snodgrass et aL, "Embryonic Stem Cells: Research and Clinical 
Potentials" in Smith and Sacher, eds. Peripheral Blood Stem Cells American Association of 
Blood Banks, Bethesda MD (1993). Thus, the cells within embryoid bodies provide a much 

10 closer model to the complexity of whole organisms than do traditional single cell or yeast 
assays, while still avoiding the cost and difficulties associated with the use of mice and larger 
mammals. Moreover, the recent availability of human embryoid bodies improves the 
predictive abilities of the invention by providing an even closer vehicle for modeling toxicity 
and identification of drugs useful for the treatment of heart disorders in human organ systems, 

15 and in humans. 

The embryoid body of the invention comprises a cell population, the majority of which being 
pluripotent cells capable of developing into different cellular lineages when cultured under 
appropriate conditions. It is preferred that the embryoid body comprises at least 51% 
20 pluripotent cells derived from totipotent ES cells. More preferably, the embryoid body 
comprises at least 75% pluripotent cells derived from totipotent ES cells. And still more 
preferably, the embryoid body comprises at least 95% pluripotent cells derived from 
totipotent ES cells. 

25 In its simplest form, the method of creating a molecular profile according to the present 
invention involves contacting embryoid bodies with a chemical composition of interest, and 
then determining the alterations in reporter activity or amount of reporter protein, or both, in 
the embryoid body exposed to the chemical composition (the "test embryoid body") compared 
to a embryoid body which was not exposed to the agent (a "control embryoid body"). 

30 Thus, the method of the present invention can be performed such that it allows self-assembly 
of the cells into the aggregates or tissue-like structures. The term "embryoid bodies" (EBs) is 
a term of art synonymous with "aggregate bodies". The terms refer to aggregates of 
differentiated and undifferentiated cells that appear when ES cells overgrow in monolayer 
cultures, or are maintained in suspension cultures. Embryoid bodies are a mixture of different 
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cell types, typically from several germ layers, distinguishable by morphological criteria. 
WO02/051987 describes a protocol to obtain embryoid bodies. The manufacturing takes place 
preferably with the "hanging drop" method or by methylcellulose culture (Wobus et al., 
Differentiation 48 (1991), 172-182). 
5 Alternatively to this, spinner flasks (stirring cultures) can be used as culture method. 
Therefore, the undifferentiated ES-cells are introduced into stirring cultures and are mixed 
permanently according to an established procedure. Therefore, 10 million ES-cells are 
introduced into 150 ml medium with 20 % FCS and are stirred constantly with the rate of 20 
rpm., wherein the direction of the stirring motion is changed regularly. 24 hours after 

10 introduction of the ES-cells an extra 100 ml medium with serum is added and thereupon 100 - 
150 ml of the medium is exchanged every day (Wartenberg et al., FASEB J. 15 (2001), 995- 
1005). Under these culture conditions large amounts of ES-cell-derived cells, i.e. 
cardiomyocytes, endothelial cells, neurons etc. depending on the composition of the medium 
can be obtained. The cells are selected by means of the resistance gene either still within the 

1 5 stirring culture or after plating, respectively. 

In a particular preferred embodiment of the present invention, embryoid bodies are prepared 
according a recent developed "mass culture" system employed in the appended examples and 

described in detail in applicant's co-pending Application No. "Novel method for the 

preparation of Embryoid Bodies (EBs) and uses thereof, Attorneys docket no: KNAUTHE- 

20 08190, Express Mail Number EV329478974US, filed on the same day as the present 
application, i.e. on July 8, 2003, which is herein incorporated by reference. 

The embryoid bodies used to test the chemical composition can be of any vertebrate species. 
The choice of the particular species from which the embryoid body is derived will typically 

25 reflect a balance of several factors. First, depending on the purpose of the study, one or more 
species may be of particular interest. For example, human embryoid bodies will be of 
particular interest for use with compositions being tested as potential human therapeutics but 
also for toxicological tests for substances including industrial chemicals, while equine, feline, 
bovine, porcine, caprine, canine, or sheep embryoid bodies may be of more interest for a 

30 potential veterinary therapeutic. Embryoid bodies of other species commonly used in 
preclinical testing, such as guinea pigs, mice, rat, rabbits, pigs, and dogs, are also preferred. 
Typically, embryoid bodies of these species will be used for "first pass" screening, or where 
detailed information on toxicity in humans is not needed, or where a result in a murine or 
other one of these laboratory species has been correlated to a known toxicity or other effect in 
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humans. Furthermore, with respect to human therapeutics, regulatory agencies generally 
require animal data before human trials can begin; it will generally be desirable to use 
embryoid bodies of species which will be used in the preclinical animal studies. The results of 
toxicity testing in the embryoid bodies can then guide the researcher on the degree and type of 
5 toxicity to anticipate during the animal trials. Certain animal species are known in the art to 
be better models of human toxicity of different types than are others, and species also differ in 
their ability to metabolize drugs; see, e.g., Williams, Environ. Health Perspect. 22 (1978), 
133-138; Duncan, Adv. Sci. 23 (1967), 537-541. Thus, the particular species preferred for use 
in a particular preclinical toxicity study may vary according to the intended use of the drug 
10 candidate. For example, a species which provide a suitable model for a drug intended to affect 
the reproductive system may not be as suitable a model for a drug intended to affect the 
nervous system. Criteria for selecting appropriate species for preclinical testing are well 
known in the art. 

15 Once an embryoid body culture has been initiated, it can be contacted with a chemical 
composition. Conveniently, the chemical composition is in an aqueous solution, preferably in 
a solvent conventionally used in cell culture, for example DMSO, and is introduced to the 
culture medium; see also the examples. The introduction can be by any convenient means, but 
will usually be by means of a pipette, a micropipettor, or a syringe. In some applications, such 

20 as high throughput screening, the chemical compositions will be introduced by automated 
means, such as automated pipetting systems, which may be on robotic arms. Chemical 
compositions can also be introduced into the medium as in powder or solid forms, with or 
without pharmaceutical excipients, binders, and other materials commonly used in 
pharmaceutical compositions, or with other carriers which might be employed in the intended 

25 use. For example, chemical compositions intended for use as agricultural chemicals or as 
petrochemical agents can be introduced into the medium by themselves to test the toxicity of 
those chemicals or agents, or introduced in combination with other materials with which they 
might be used or which might be found in the environment, to determine if the combination of 
the chemicals or agents has a synergistic effect. Typically, the cultures will be shaken at least 

30 briefly after introduction of a chemical composition to ensure the composition is dispersed 
throughout the medium. 

The time as which a chemical composition is added to the culture is within the discretion of 
the practitioner and will vary with the particular study objective. Conveniently, the chemical 
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composition will be added as soon as the embryoid body develops from the stem cells, 
permitting the determination of the alteration in protein or gene expression on the 
development of all the tissues of the embryoid body. It may be of interest, however, to focus 
the study on the effect of the composition on a particular tissue type. As previously noted, 
5 individual tissues, such as muscle, nervous, and hepatic tissue, are known to develop at 
specific times after the embryoid body has formed. Addition of the chemical composition can 
therefore be staged to occur at the time the tissue of interest commences developing, or at a 
chosen time after commencement of that development, in order to observe the effect on 
altering gene or protein expression in the tissue of interest. 

10 

Different amounts of a chemical composition will be used to contact an embryoid body 
depending on the amount of information known about the toxicity of that composition, the 
purposes of the study, the time available, and the resources of the practitioner. A chemical 
composition can be administered at just one concentration, particularly where other studies or 

15 past work or field experience with the compound have indicated that a particular 
concentration is the one which is most commonly found in the body. More commonly, the 
chemical composition will be added in different concentrations to cultures of embryoid bodies 
run in parallel, so that the effects of the concentration differences on gene or protein 
expression and, hence, the differences in toxicity of the composition at different 

20 concentrations, can be assessed. Typically, for example, the chemical composition will be 
added at a normal or medium concentration, and bracketed by twofold or fivefold increases 
and decreases in concentration, depending on the degree of precision desired. 

Where the composition is one of unknown toxicity, a preliminary study is conveniently first 
25 performed to determine the concentration ranges at which the composition will be tested. A 
variety of procedures for determining concentration dosages are known in the art. One 
common procedure, for example, is to determine the dosage at which the agent is directly 
toxic. The practitioner then reduces the dose by one half and performs a dosing study, 
typically by administering the agent of interest at fivefold or twofold dilutions of 
30 concentration to parallel cultures of cells of the type of interest. For environmental 
contaminants, the composition will usually also be tested at the concentration at which it is 
found in the environment. For agricultural chemicals, such as pesticides which leave residues 
on foodstuffs, the agent will usually be tested at the concentration at which the residue is 
found, although it will likely be tested at other concentrations as well. Thus, the dilution of 
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test compounds can be done by making in separated tubes a series of dilution of 50 or 100 
fold concentrated compounds in DMSO. One or two |il of each dilution are distributed in each 
well before cell suspension distribution. Particularly preferred are stock and dilution series as 
described for retinoic acid in example 3. 

5 

The above considerations with respect to contacting the compounds with the EBs, contacting 
time, etc, also apply to the assays of the invention performed on e.g. ES cells, tissue and non- 
human animals, if applicable. 

10 The present invention also relates to reporter gene constructs for monitoring cell 
differentiation comprising the recombinant nucleic acid molecule defined previously as well 
as to cells capable of differentiating into at least one particular cell type, that contain said 
reporter gene construct, or are obtainable by the methods described herein. Likewise, cell 
aggregates, tissues, organs, implants or transplants comprising said reporter gene construct, 

15 obtainable by the methods described herein or constituted from said cells, cell aggregates, 
tissues, organs, implants or transplants are subject of the invention. 

In a still further embodiment, the present invention relates to a transgenic non-human animal 
which can be generated from the mentioned ES cells and ES cell derived cell types and cell 

20 aggregates; see supra. A method for the production of a transgenic non-human animal, which 
is encompassed by the present invention, for example transgenic mouse, comprises 
introduction of a reporter gene as described above into a germ cell, an embryonic cell, stem 
cell or an egg or a cell derived therefrom. The non-human animal can be used in accordance 
with a screening method of the invention described herein. Production of transgenic embryos 

25 and screening of those can be performed, e.g., as described by A. L. Joyner Ed., Gene 
Targeting, A Practical Approach (1993), Oxford University Press. A general method for 
making transgenic non-human animals is described in the art, see for example WO 94/24274. 
For making transgenic non-human organisms (which include homologously targeted non- 
human animals), embryonal stem cells (ES cells) are preferred. Murine ES cells, such as AB- 

30 1 line grown on mitotically inactive SNL76/7 cell feeder layers (McMahon and Bradley, Cell 
62 (1990), 1073-1085) essentially as described (Robertson, in Teratocarcinomas and 
Embryonic Stem Cells: A Practical Approach. E. J. Robertson, ed. (Oxford: ERL Press) 
(1987), 71-112) may be used for homologous gene targeting. Other suitable ES lines include, 
but are not limited to, the E14 line (Hooper et al., Nature 326 (1987), 292-295), the D3 line 
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(Doetschman et al., J. Embryol. Exp. Morph. 87 (1985), 27-45), the CCE line (Robertson et 
al, Nature 323 (1986), 445-448), the AK-7 line (Zhuang et al., Cell 77 (1994), 875-884) and 
others; see also supra. The success of generating a mouse line from ES cells bearing a specific 
targeted mutation depends on the pluripotence of the ES cells (i.e., their ability, once injected 
5 into a host developing embryo, such as a blastocyst or morula, to participate in embryogenesis 
and contribute to the germ cells of the resulting animal). The blastocysts containing the 
injected ES cells are allowed to develop in the uteri of pseudopregnant nonhuman females and 
are born as chimeric mice. The resultant transgenic mice are backcrossed and screened for the 
presence of the correctly targeted transgene (s) by PCR or Southern blot analysis on tail 

10 biopsy DNA of offspring so as to identify transgenic mice for the reporter locus/loci. For 
example, transient expression of a recombinase gene directly introduced into fertilized eggs 
by pronuclear injection provides a fast and efficient method for generating mutant mice with 
desired deletions or translocations in target genes. Efficient removal of loxP-flanked DNA 
sequences in a gene-targeted locus by transient expression of Cre recombinase in fertilized 

15 eggs is described by Sunaga et al, Mol. Reprod. Dev. 46 (1997), 109-1 13. 

Methods for producing transgenic flies, such as Drosophila melanogaster are also described in 
the art, see for example US-A-4,670,388, Brand and Perrimon, Development 118 (1993), 401- 
415; and Phelps and Brand, Methods 14 (1998), 367-379. Transgenic worms such as C. 
20 elegans can be generated as described in Mello et al., EMBO J. 10 (1991), 3959-3970, 
Plasterk, Methods Cell. Biol. 48 (1995), 59-80. 

All of the embodiments above can not only be used to monitor cell differentiation and 
optimize culturing conditions using modulators in their pure forms but the effective agent(s) 

25 might also be contained in a composition. Hence, compositions of matter comprising any one 
of those recombinant nucleic acid molecules, cells, cell aggregates, or tissues of the present 
invention as described herein are encompassed in the scope of the present invention. As 
described herein, those compositions and methods of the invention can be used for a variety 
of purposes, for example for analyzing early steps of tissue formation during embryonic 

30 development or the influence of factors and compounds on this process. 



In a particularly preferred embodiment, the present invention relates to arrays and chips 
comprising a solid support and attached thereto or suspended thereon cells, cell aggregates or 
tissue obtained by the method of the present invention or being in the differentiation process. 
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Such arrays generally consist of a substrate of glass, plastic or silicon on which the test cells 
aggregates or tissues are deposited in a particular pattern. Depending on the type of array, 
these might additionally be covered by a conductor, e.g. gold, platinum, indium-tin-oxide, 
iridium, etc., which allows a direct measurement by employing the conductivity of cells. The 
5 use of such planar microelectrode arrays for cultured cells and cell aggregates as biosensors is 
of particular interest. 

Furthermore the invention relates to an apparatus for analyzing said array, which will be able 
to measure the particular read-out of the assay employed, e.g. SEAP reaction product, in a 
10 quantitative manner and assign the results to the position on the array. 

The system for monitoring cell differentiation according to the present invention can also be 
employed to identify and characterize substances that modify the differentiation process. 
Populations of differentiated cells and tissue suitable for human administration have to be 

15 identified after the differentiation process to avoid administering undifferentiated cells with 
high tumorigenic potential or cells differentiated into cells that are not needed for the 
particular therapeutic purpose. The positive identification of cells differentiated into a certain 
cell type allows also the optimization of culturing conditions that is necessary to obtain the 
relatively high numbers of differentiated cells needed for therapeutic purposes. Thus, there is 

20 a need to identify conditions for culturing ES cells that drive their differentiation into the 
desired direction. 

Accordingly, a preferred embodiment of the invention is a method of obtaining and/or 
profiling a modulator of cell differentiation comprising: 

contacting a test sample comprising a cell, cell aggregate, tissue or organ or a non-human 
25 animal as defined above with a test substance; and 

determining the effect of the test substance on the amount of the reporter gene product or 
activity compared to a control sample or animal. 

In particular, the methods of the invention can be used for toxicological, embryotoxic, 
30 mutagenic, and/or teratogenic in vitro tests; see supra. Another important aspect of the present 
invention is therefore a method of determining toxicity, preferably teratogenicity, 
embryotoxicity, chronic or acute toxicity of a compound comprising the steps of the methods 
described herein, wherein a reduced or enhanced level or activity of said reporter gene 
product is indicative for the toxicity of the compound. 



30 

The assays may be simple "yes/no" assays to determine whether there is a change in 
expression of the secreted reporter protein by detecting reporter, e.g., enzyme activity or the 
amount of reporter protein. The assay may be made quantitative by comparing the amount of 
5 activity and/or amount of reporter in a test sample with the levels of activity and/or amount in 
a standard sample. As described in the examples, the test compound or a plurality of test 
compounds are subjected to the test cell, preferably embryoid body in different concentrations 
or dilution series, preferably at doses that correspond to physiological levels of the 
corresponding type of test compounds; see also example 3. It is thus also possible to easy 

10 generate compound profiles in purpose similar to those described in WOOO/34525. For 
example, two or more assays may be used, wherein for each assay the test cells or, e.g., 
embryoid bodies comprise recombinant DNA molecules with different cell- and/or 
development-specific promoters operably linked to the same or different reporters, preferably 
wherein each reporter is a secreted protein, e.g., enzyme, most preferably wherein one or each 

15 reporter is SEAP. Those assays can be performed in parallel or subsequently; or the results of 
one assay may be compared with results of a corresponding assay performed elsewhere. 
Likewise, it is envisaged to provide and use cells which comprise two of the mentioned 
recombinant molecules but with different promoters and reporters as mentioned above. Thus, 
a molecular profile of a test chemical composition can be established by detecting the 

20 alterations in reporter gene activity or amount of reporter protein in, for example, embryoid 
bodies contacted by the test chemical composition as described in previous sections. Once the 
molecular profile of the test composition is determined, it can be compared to that of a 
chemical composition with predetermined toxicities or, preferably, to a library of molecular 
profiles of chemical compositions with predetermined toxicities. The outcome of such 

25 comparison provide information for one to predict the likelihood of whether the test 
composition is toxic, what type of toxicities, and how toxic it would be as compared to the 
other known toxic compositions. 

For the purpose of practicing the invention, the predictions of toxicity of the test composition 
30 based on its molecular profiles in ES cells, tissue, etc, preferably EB cells does not have to be 
100% accurate. To have a major positive impact on the efficiency and costs of drug 
development, one only has to modestly increase the probability that the less toxic and thus 
more successful drug candidates are, for example, on the top half of a prioritized list of new 
drug leads. 
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Since the cells and tissue obtained in accordance with the present invention more closely 
resemble the in vivo situation compared to conventional cell based assays, the results obtained 
5 by the assays of the present invention are expected to correlate to in vivo teratogenicity or 
embryotoxicity of the tested compounds as well. The simple read-out of the reporter molecule 
assay is easier measured and interpreted than determining the influence of a substance by 
monitoring the embryonic and post-natal development of a test animal in all aspects and in 
different stages of development. This requires a number of animals to be sacrificed at 
10 different ages, whereas by employing the reporter molecule secreted in an developmental- 
specific manner, samples of body fluids from the animal can be obtained repeatedly and 
allowing the monitoring over an extended time period up to the life-time of the animal. This 
also reduces the variability of test results. 

15 Several test substances can be combined and either added simultaneously or sequentially to 
gain information about possible enhancing or quenching effects. Thus a further aspect of the 
invention relates to the method described previously, wherein said contacting step further 
includes contacting said test sample or animal with at least one second test substance in the 
presence of said first test substance. Two or more substances tested in combination will 

20 provide information about their interaction in general. 

A preferred embodiment of the methods according to the present invention involves adding a 
compound known to activate or inhibit differentiation process to the culture medium or 
animal, particularly if this test substance is a therapeutic agent or a mixture thereof. This 
25 screening may be done, for example, either because the compound is known to have a effect 
on the differentiation of certain cell types and is tested to determine its potential for guiding 
the differentiation of other cell types, or because a compound designed to have effects 
elsewhere may have unintended side effects. The last aspect applies particularly to therapeutic 
agents. 

30 

Both aspects of drug screening can be adapted to compound library screening methods as well 
as screening industrial chemicals from the shelf. Thus another embodiment of the invention 
encompasses the method described above, wherein preferably in a first screen said test 
substance is comprised in and subjected as a collection of test substances. Such a collection of 
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test substances, usually a combinatorial compound library, can have a diversity of about 10 3 
to about 10 5 and is successively reduced in running the method several times until the 
diversity has been reduced to down to a desired factor, preferably down to one compound or 
particular class of compounds. Optionally the method of the present invention can combined 
5 with other screening methods known in the art. Methods for the generation and use of 
peptidomimetic combinatorial libraries are described in the prior art, for example in Ostresh, 
Methods in Enzymology 267 (1996), 220-234 and Domer, Bioorg. Med. Chem. 4 (1996), 
709-715. Drug discovery by dynamic combinatorial libraries is described, for example, in 
Nat. Rev. Drug Discov. 1 (2002), 26-36 and Drug Discov. Today 7 (2002), 117-125. 
10 Compounds identified using the method of the invention can be further evaluated in single 
experiments employing other methods of the present invention. 

Compounds and candidate agents are obtained from a wide variety of sources including 
libraries of synthetic or natural compounds, or as mentioned industrial chemicals; see supra. 

15 For example, numerous means are available for random and directed synthesis of a wide 
variety of organic compounds and biomolecules, including expression of randomized 
oligonucleotides and oligopeptides. Alternatively, libraries of natural compounds in the form 
of bacterial, fungal, plant and animal extracts are available or readily produced. For example, 
inhibition of tumor-induced angiogenesis and matrix-metalloproteinase expression in 

20 confrontation cultures of embryoid bodies and tumor spheroids by plant ingredients used in 
traditional Chinese medicine has been described by Wartenberg et al. in Lab. Invest. 83 
(2003), 87-98. 

A preferred embodiment of the screening method is performed on an array as provided by the 
25 invention 

The assay methods of the present invention can be in a conventional laboratory format or 
adapted for high throughput screening. The term "high throughput screening" (HTS) refers to 
an assay design that allows easy analysis of multiple samples simultaneously, and capacity for 
30 robotic manipulation. Another desired feature of high throughput assays is an assay design 
that is optimized to reduce reagent usage, or minimize the number of manipulations in order 
to achieve the analysis desired. Examples of assay formats include 96-well, 384-well or more- 
well plates, levitating droplets, and "lab on a chip" microchannel chips used for liquid 
handling experiments. It is well known by those in the art that as miniaturization of plastic 



molds and liquid handling devices are advanced, or as improved assay devices are designed, 
that greater numbers of samples may be performed using the design of the present invention. 
In the method of the invention, said one or more cells or tissues are preferably contained in a 
container, for example in a well of a microtiter plate, which may be a 24, 96, 384 or 1586 well 
5 plate. Alternatively, the cells can be introduced into a microfluidics device, such as those 
provided by Caliper (Newton, MA, USA). In another preferred embodiment, the method of 
the present invention comprises taking 3, 4, 5, 7, 10 or more measurements, optionally at 
different positions within the container. 

10 The methods according to the invention can also being performed using cells that are 
genetically engineered to (over) express or inhibit the expression of a target gene. Examples 
for genetically altered stem cells are described in WO03/0 14326 which provides mammalian 
stem cells expressing the VEGF receptor VEGFR-1 or WO95/06409 disclosing human 
hematopoietic stem cells expressing recombinant DNA constructs encoding a recombinant 

15 molecule containing a signal transducing region and an antigenic specificity region. 
Alternatively to using already altered stem cells, said target gene can be transfected at the 
same time as the construct of the present invention conferring the expression of the secreted 
reporter molecule. 

20 HTS, combinatorial library screening as well as conventional assay formats can also be 
employed to identify novel compounds which have an effect on differentiation. Novel as well 
as compounds previously known to modulate cell differentiation can be used to manufacture a 
drug for this purpose. Thus a preferred embodiment of the present invention is a method of 
obtaining and manufacturing a drug which promotes or inhibits formation of specific cell 

25 types comprising the steps of the methods described above, wherein an enhanced or reduced 
level or activity of the reporter gene product is indicative for the drug. 

Compounds of interest encompass numerous chemical classes, though typically they are 
organic molecules. Candidate agents comprise functional groups necessary for structural 
interaction with proteins, particularly hydrogen bonding, and typically include at least an 
30 amine, carbonyl, hydroxyl or carboxyl group, preferably at least two of the functional 
chemical groups. The candidate agents often comprise cyclical carbon or heterocyclic 
structures and/or aromatic or polyaromatic structures substituted with one or more of the 
above functional groups. Candidate agents are also found among biomolecules including 
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peptides, saccharides, fatty acids, steroids, purines, pyrimidines, derivatives, structural 
analogs or combinations thereof. 

Additionally, natural or synthetically produced libraries and compounds are readily modified 
through conventional chemical, physical and biochemical means, and may be used to produce 
5 combinatorial libraries. Known pharmacological agents may be subjected to directed or 
random chemical modifications, such as acylation, alkylation, esterification, amidification, 
etc. to produce structural analogs. 

The compounds may also be included in a sample including fluids to which additional 
10 components have been added, for example components that affect the ionic strength, pH, total 
protein concentration, etc. In addition, the samples may be treated to achieve at least partial 
fractionation or concentration. Biological samples may be stored if care is taken to reduce 
degradation of the compound, e.g. under nitrogen, frozen, or a combination thereof. The 
volume of sample used is sufficient to allow for measurable detection, usually from about 0.1 
15 |il to 1 ml of a biological sample is sufficient. 

Differentiation processes do not only play a major role in embryonic development but are also 
important for an adult organism since the continuous replacement of senescent or cells 
damaged by injury or disease is achieved by precursor cells differentiating into the cells 
20 needed. An important aspect of the present invention is thus a method of manufacturing an 
agent which supports wound healing and/or healing of damaged tissue comprising the steps 
described herein, wherein an enhanced level or activity of the reporter gene product is 
indicative for said agent. 

25 Compounds identified, isolated and/or produced by the methods described above can also be 
used as lead compounds in drug discovery and preparation of drugs or prodrugs. This usually 
involves modifying the lead compound or a derivative thereof or an isolated compound as 
described hereinbefore such as modifying said substance to alter, eliminate and/or derivatize a 
portion thereof suspected causing toxicity, increasing bioavailability, solubility and/or half- 

30 life. Accordingly, a preferred embodiment of the present invention encompasses the methods 
described previously and further modifying said substance to alter, eliminate and/or derivatize 
a portion thereof suspected causing toxicity, increasing bioavailability, solubility and/or half- 
life. Methods for clinical compound discovery comprises for example ultrahigh-throughput 
screening (Sundberg, Curr. Opin. Biotechnol. 11 (2000), 47-53) for lead identification, and 
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structure-based drug design (Verlinde and Hoi, Structure 2 (1994), 577-587) and 
combinatorial chemistry (Salemme et aL, Structure 15 (1997), 319-324) for lead 
optimization. The various steps recited above are generally known in the art. For example, 
computer programs for implementing these techniques are available; e.g., Rein, Computer- 
5 Assisted Modeling of Receptor-Ligand Interactions (Alan Liss, New York, 1989). Methods 
for the preparation of chemical derivatives and analogues are well known to those skilled in 
the art and are described in, for example, Beilstein, Handbook of Organic Chemistry, Springer 
edition New York Inc., 175 Fifth Avenue, New York, N.Y. 10010 U.S.A. and Organic 
Synthesis, Wiley, New York, USA. Furthermore, peptidomimetics and/or computer aided 

10 design of appropriate derivatives and analogues can be used, for example, according to the 
methods described above. Methods for the lead generation in drug discovery also include 
using proteins and detection methods such as mass spectrometry (Cheng et al. J. Am. Chem. 
Soc. 1 17 (1995), 8859-8860) and some nuclear magnetic resonance (NMR) methods (Fejzo et 
al., Chem. Biol. 6 (1999), 755-769; Lin et al., J. Org. Chem. 62 (1997), 8930-8931). They 

15 may also include or rely on quantitative structure-action relationship (QSAR) analyses 
(Kubinyi, J. Med. Chem. 41 (1993), 2553-2564, Kubinyi, Pharm. Unserer Zeit 23 (1994), 
281-290) combinatorial biochemistry, classical chemistry and others; see, for example, 
Holzgrabe and Bechtold, Pharm. Acta Helv. 74 (2000), 149-155. 

20 In one embodiment, the method of the invention further comprises mixing the substance 
isolated or modified with a pharmaceutically acceptable carrier. Examples of carriers and 
methods of formulation may be found in Remington's Pharmaceutical Sciences. 

Substances are metabolized after their in vivo administration in order to be eliminated either 
25 by excretion or by metabolism to one or more active or inactive metabolites (Meyer, J. 
Pharmacokinet. Biopharm. 24 (1996), 449-459). Thus, rather than using the actual compound 
or drug identified and obtained in accordance with the methods of the present invention a 
corresponding formulation as a pro-drug can be used which is converted into its active form 
in the patient by his/her metabolism. Precautionary measures that may be taken for the 
30 application of pro-drugs and drugs are described in the literature; see, for review, Ozama, J. 
Toxicol. Sci. 21 (1996), 323-329. 



The present invention also relates to a kit containing specific reagents such as those described 
hereinbefore useful for conducting any one of the above described methods of the present 
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invention, containing for example the reporter gene construct described hereinbefore, multi- 
or pluripotent cells, standard compounds, and optionally culture medium, recombinant nucleic 
such acid molecules, detection means for the reporter molecule, in particular for SEAP such 
as EDTA, L-Homoarginine, NBT/BCIP, fixative solution, incubation buffer, PBS, etc. Such a 
5 kit would typically comprise a compartmentalized carrier suitable to hold in close 
confinement at least one container. The carrier would further comprise reagents useful for 
performing said methods. The carrier may also contain a means for detection such as labeled 
enzyme substrates or the like. 

1 0 Yet another aspect of the present invention relates to a method of conducting a drug discovery 
business, comprising: 

- providing one or more assay systems as described herein for identifying a modulator of 
cell differentiation; and/or 

- conducting therapeutic profiling of modulators identified in the previous step, or further 
15 analogs thereof, for efficacy and toxicity in animals according to the invention; and 

formulating a pharmaceutical preparation including one or more modulators identified in 
the previous step as having an acceptable therapeutic profile. 

Utilizing the methods described above, the identity of a modulator of differentiation or a 
20 derivative thereof are determined. Agents are identified by their ability to alter the level of 
secreted reporter molecule. For suitable lead compounds that are identified, further 
therapeutic profiling of the agent, or analogs thereof, can be carried out for assessing efficacy 
and toxicity in animals. Those compounds having therapeutic profiles after animal testing can 
be formulated into pharmaceutical preparations for use in humans or for veterinary uses. The 
25 subject business method can include an additional step of establishing a distribution system 
for distributing the pharmaceutical preparation for sale, and may optionally include 
establishing a sales group for marketing the pharmaceutical preparation. 

Instead of developing the identified cell differentiation modulating agents in house, further 
30 drug development can also be achieved by a different company. Thus a further aspect of the 
present invention relates to a method of conducting a target discovery business comprising: 

- providing one or more assay systems described herein for identifying modulators of cell 
differentiation; 
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- (optionally) conducting therapeutic profiling of modulators identified in the previous step 
for efficacy and toxicity in animals according to the invention; and 

- licensing, to a third party, the rights for further drug development and/or sales for 
modulators identified in the first step, or analogs thereof. 

5 For suitable lead compounds that have been identified, further profiling of the agent, or 
further analogs thereof, can be carried out for assessing efficacy and toxicity in animals, 
depending on the modalities of the agreement with the respective third party. Further 
development of those compounds for use in humans or for veterinary uses will then be 
conducted by the third party. The subject business method will usually involve either the sale 
10 or licensing of the rights to develop said compound but may also be conducted as a service, 
offered to drug developing companies for a fee. 

The present invention also relates to modulators of cell differentiation identified according to 
the methods described above as well as to pharmaceutical compositions for use in the 
15 modulation of cell differentiation comprising such a modulator. 

The modulator according to the invention can be combined with suitable diluents or carriers, 
preferably those which are pharmaceutically acceptable. Examples of such carriers, diluents 
and methods of formulation may be found in Remington's Pharmaceutical Sciences. To form 

20 a pharmaceutically acceptable composition suitable for effective administration, such 
compositions will contain an effective amount of the modulator. Carriers or diluents are 
usually sterile and non-toxic, and defined as vehicles commonly used to formulate 
pharmaceutical compositions for animal or human administration. The diluent is selected so 
as not to affect the biological activity of the combination. Examples of such diluents are 

25 distilled water, physiological saline, Ringer's solutions, dextrose solution, and Hank's 
solution. In addition, the pharmaceutical composition or formulation may also include other 
carriers, adjuvants, or nontoxic, nontherapeutic, nonimmunogenic stabilizers and the like. A 
therapeutically effective dose refers to that amount of modulator which is sufficient to achieve 
the desired effect on differentiation of target cells. 

30 Further examples of suitable pharmaceutical carriers are well known in the art and include 
phosphate buffered saline solutions, emulsions, such as oil/water emulsions, various types of 
wetting agents, sterile solutions etc. Compositions comprising such carriers can be formulated 
by well known conventional methods. Accordingly, the present invention also provides a 
method of making a pharmaceutical composition for use in modulating cell differentiation 
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comprising mixing a modulator of cell differentiation identified according to a method of the 
invention with a suitable diluent or carrier. 

The use of a reporter gene construct, a cell, a cell aggregate, tissue, organ, implant or 
5 transplant, a non-human animal, a composition, an array or the apparatus defined herein in 
drug discovery or pharmacokinetic or pharmacological profiling is also preferred embodiment 
of the present invention. Hence, the means and methods of the present invention described 
herein-before can be used in a variety of applications including but not limited to "loss of 
function" assays with ES cells containing homozygous mutations of specific genes, "gain of 
10 function" assays with ES cells overexpressing exogenous genes, developmental analysis of 
teratogenic/embryotoxic compounds in vitro, pharmacological assays and the establishment of 
model systems for pathological cell functions, and application of differentiation and growth 
factors for induction of selectively differentiated cells which, can be used as a source for 
tissue grafts; see for review, e.g., Guan et al., Altex 16 (1999), 135-141. 

15 

As described in the examples, for the expression of the secreted human placental alkaline 
phosphatase protein (SEAP), a particular promoter has been used, i.e. the promoter region of 
the mouse alpha myosin heavy chain gene; see NCBI GenBank Accession No. #U71441 
(SEQ ID NO: 1). In accordance with the present invention, it could be shown that this 

20 promoter region is particularly suited for cell type specific expression of a reporter gene, 
especially for the purposes of the present invention. In addition, further experiments 
performed in accordance with the present invention surprisingly revealed that the mouse 
ventricular myosin regulatory light chain gene promoter (see NCBI GenBank Accession No. 
#AF302688; SEQ ID NO: 2) is particularly suited for the expression of the reporter gene as 

25 well. Thus, the present invention also relates to a recombinant DNA molecule as described 
hereinbefore in a cell-type specific reporter system, comprising the promoter of the mouse 
alpha myosin heavy chain gene and/or of the mouse ventricular myosin regulatory light chain 
gene, in particular the sequence depicted in NCBI GenBank Accession No. #1171441 (SEQ 
ID NO: 1) and #AF302688 (SEQ ID NO: 2), respectively, or a fragment thereof. Typically, 

30 the recombinant DNA molecule is a vector and comprises other functional DNA sequences 
such as ori, bacterial selectable marker, polyadenylation signal, multiple cloning sites, and the 
like. Hence, such vector may also be used to express specifically for examples growth factors 
and the like or target cDNAs in order to elucidate their function during cell development. 
Thus, the present invention involves the use of the mentioned promoter regions for the 
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expression of a DNA sequence encoding a reporter protein or selectable marker; see supra. In 
a particular, the present invention relates to the expression vector described in Example 1 
comprising the nucleotide sequence depicted in SEQ ID NO. 3 as well as to a corresponding 
vector, wherein the mouse alpha myosin heavy chain gene promoter has been replaced by the 
5 mouse ventricular myosin regulatory light chain gene promoter, and their use in the afore- 
described embodiments. 

These and other embodiments are disclosed and encompassed by the description and 
examples of the present invention. Further literature concerning any one of the materials, 

10 methods, uses and compounds to be employed in accordance with the present invention may 
be retrieved from public libraries and databases, using for example electronic devices. For 
example the public database "Medline" may be utilized, which is hosted by the National 
Center for Biotechnology Information and/or the National Library of Medicine at the National 
Institutes of Health. Further databases and web addresses, such as those of the European 

15 Bioinformatics Institute (EBI), which is part of the European Molecular Biology Laboratory 
(EMBL) are known to the person skilled in the art and can also be obtained using internet 
search engines. An overview of patent information in biotechnology and a survey of relevant 
sources of patent information useful for retrospective searching and for current awareness is 
given in Berks, TTBTECH 12 (1994), 352-364. 

20 

The above disclosure generally describes the present invention. A more complete 
understanding can be obtained by reference to the following specific examples and figure 
which are provided herein for purposes of illustration only and are not intended to limit the 
scope of the invention. The contents of all cited references (including literature references, 
25 issued patents, published patent applications as cited throughout this application and 
manufacturer's specifications, instructions, etc) are hereby expressly incorporated by 
reference; however, there is no admission that any document cited is indeed prior art as to the 
present invention. 

30 The practice of the present invention will employ, unless otherwise indicated, conventional 
techniques of cell biology, cell culture, molecular biology, transgenic biology, microbiology, 
recombinant DNA, and immunology, which are within the skill of the art. 
For further elaboration of general techniques concerning stem cell technology, the practitioner 
can refer to standard textbooks and reviews, for example Teratocarcinomas and embryonic 
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stem cells: A practical approach (E. J. Robertson, ed., IRL Press Ltd. 1987); Guide to 
Techniques in Mouse Development (P. M. Wasserman et al, eds., Academic Press 1993); 
Embryonic Stem Cell Differentiation in Vitro (Wiles, Meth. Enzymol. 225 (1993), 900,); 
Properties and uses of Embryonic Stem Cells: Prospects for Application to Human Biology 

5 and Gene Therapy (Rathjen et al., Reprod. Fertil. Dev. 10 (1998), 31,). Differentiation of stem 
cells is reviewed in Robertson, Meth. Cell Biol. 75 (1997), 173; and Pedersen, Reprod. FertiL 
Dev. 10 (1998), 31. Besides the sources for stem cells described already above further 
references are provided; see Evans and Kaufman, Nature 292 (1981), 154-156; Handyside et 
al., Roux's Arch. Dev. Biol., 196 (1987), 185-190; Flechon et al, J. Reprod. Fertil. Abstract 

10 Series 6 (1990), 25; Doetschman et al., Dev. Biol. 127 (1988), 224-227; Evans et al., 
Theriogenology 33 (1990), 125-128; Notarianni et al., J. Reprod. Fertil. Suppl., 43 (1991), 
255-260; Giles et al., Biol. Reprod. 44 (Suppl. 1) (1991), 57; Strelchenko et al., 
Theriogenology 35 (1991), 274; Sukoyan et al, Mol. Reprod. Dev. 93 (1992), 418-431; 
Iannaccone et al., Dev. Biol. 163 (1994), 288-292. 

15 

Methods in molecular genetics and genetic engineering are described generally in the current 
editions of Molecular Cloning: A Laboratory Manual, (Sambrook et al., (1989) Molecular 
Cloning: A Laboratory Manual, 2nd ed., Cold Spring Harbor Laboratory Press); DNA 
Cloning, Volumes I and II (D. N. Glover ed., 1985); Oligonucleotide Synthesis (M. J. Gait 

20 ed., 1984); Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. 1984); 
Transcription And Translation (B. D. Hames & S. J. Higgins eds. 1984); Culture Of Animal 
Cells (R. I. Freshney, Alan R. Liss, Inc., 1987); Gene Transfer Vectors for Mammalian Cells 
(Miller & Calos, eds.); Current Protocols in Molecular Biology and Short Protocols in 
Molecular Biology, 3rd Edition (F. M. Ausubel et al., eds.); and Recombinant DNA 

25 Methodology (R. Wu ed., Academic Press). Gene Transfer Vectors For Mammalian Cells (J. 
H. Miller and M. P. Calos eds., 1987, Cold Spring Harbor Laboratory); Methods In 
Enzymology, Vols. 154 and 155 (Wu et al. eds.); Immobilized Cells And Enzymes (IRL 
Press, 1986); B. Perbal, A Practical Guide To Molecular Cloning (1984); the treatise, 
Methods In Enzymology (Academic Press, Inc., N.Y.); Immunochemical Methods In Cell 

30 And Molecular Biology (Mayer and Walker, eds., Academic Press, London, 1987); Handbook 
Of Experimental Immunology, Volumes HV (D. M. Weir and C. C. Blackwell, eds., 1986). 
Reagents, cloning vectors, and kits for genetic manipulation referred to in this disclosure are 
available from commercial vendors such as BioRad, Stratagene, Invitrogen, and Clontech. 
General techniques in cell culture and media collection are outlined in Large Scale 
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Mammalian Cell Culture (Hu et al., Curr. Opin. BiotechnoL 8 (1997), 148); Serum-free 
Media (Kitano, Biotechnology 17 (1991), 73); Large Scale Mammalian Cell Culture (Curr. 
Opin. Biotechnol. 2 (1991), 375); and Suspension Culture of Mammalian Cells (Birch et al., 
Bioprocess Technol. 19 (1990), 251). Other observations about the media and their impact on 
5 the culture environment have been made by Marshall McLuhan and Fred Allen. 
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EXAMPLES 

Example 1 : Generation of a vector for cardiac-specific SEAP expression 

Using standard cloning procedures, a SEAP expression vector was obtained in the following 
5 way. A 5.4 kb fragment containing the promoter region of the a-mouse myosin heavy chain 
gene (Genbank No. U71441) was inserted into the polylinker of the pEGFP-1 vector 
(Clontech). The a-myosin heavy chain gene is specifically expressed by cardiac myocytes in 
differentiating ES cells (Gulick et al., J. Biol. Chem. 266 (1991), 9180-91855; Palermo et al., 
Cell. Mol. Biol. Res. 41 (1995), 501-519; Morkin, Microsc. Res. Tech. 50 (2000), 522-531; 
10 Boheler et al., Circ. Res. 91 (2002), 189-201). Subsequently, the EGFP encoding sequence 
was removed and replaced by a SEAP encoding sequence (Berger et al., Gene 66 (1988), 1- 
10). 

The resulting vector, designated a-MHC-SEAP, thus contains the SEAP gene under the 
transcriptional control of a heart-specific promoter. Further features of the vector include the 
presence of a Kozak consensus sequence to allow efficient translation of the SEAP mRNA, 
the presence of SV40 polyadenylation signals directing proper processing of the SEAP 
mRNA, the presence of a neomycin resistance cassette consisting of the SV40 early promoter, 
the neomycin/kanamycin resistance gene of Tn5 and polyadenylation signals from the Herpes 
simplex virus thymidine kinase gene to allow selection of stably transfected ES cells, and the 
presence of a bacterial promoter upstream of this cassette to confer kanamycin resistance in E. 
coli. 

Example 2: Generation of ES cell transfectants 

25 The a-MHC-SEAP vector was linearized by restriction endonuclease digestion and 
introduced into D3 mouse embryonic stem (ES) cells (American Type Culture Collection) by 
electroporation. Following neomycin selection, single ES cell colonies were isolated and 
expanded. ES cell culture and transfection as well as selection of stable transfectants was 
done using standard procedures (Torres and Ktihn, Laboratory protocols for conditional gene 

30 targeting. Oxford University Press, (1997) New York). 



43 



Example 3: Differentiation of ES cells into cardiac cells in the presence of retinoic acid 

The presence of excessive retinoic acid has striking effects on early heart development in vivo 
(Osmond et al., Development 113 (1991), 1405-1417; Yutzey et al., Development 120 (1994), 
871-883; Dickman and Smith, Dev. Dyn. 206 (1996), 39-48; Drysdale et al, Dev. Biol. 188 
5 (1997), 205-215). Furthermore, all-trans retinoic acid (CAS No. 302-79-4) was shown to 
inhibit the generation of cardiomyocytes from ES cells in vitro (Seiler et al. ALTEX 19 
Suppl. 1 (2002), 55-63). Therefore, ES cells stably transfected with the a-MHC-SEAP vector 
were treated with retinoic acid at different concentrations and subjected to a differentiation 
protocol that resulted in the generation of cardiac cells. If secreted reporter, here SEAP, 
10 activity can indeed be used to quantify ES cell-derived cardiomyocytes, the amount of SEAP 
activity in culture supernatants should correlate inversely with the concentrations of retinoic 
acid present in the cultures. 

All incubations of cell cultures were performed at 37°C in a humidified atmosphere 
containing 5% CO2. 

15 

ES cell aggregation 

ES cells stably transfected with the a-MHC-SEAP vector (see Example 2) were trypsinised to 
obtain a single cell suspension. Cells were collected by centrifugation (800 g for 5 min) and 
resuspended in Iscove's Modified Dulbecco's Medium (IMDM, Invitrogen) supplemented 

20 with 20% (v/v) fetal bovine serum (FBS, Invitrogen) at a density of 2xl0 6 cells/ml. To 
aggregate ES cells, 4 ml of this suspension were incubated on a 6 cm bacteriological dish 
(Greiner) on a rocking platform (GFL 3006) with agitation (50 rpm). After 6 hours, 2 ml of 
that culture were added to 13 ml IMDM containing 20% FBS in a T25 flask (Falcon). The 
flask was incubated with rocking (50 rpm; GFL 3006) for 18 hours. During this procedure ES 

25 cell aggregates (embryoid bodies; EBs) were formed. 

ES cell differentiation in the presence of retinoic acid 

All-trans retinoic acid (Sigma) was dissolved in dimethyl sulfoxide (DMSO, Sigma) to obtain 
10" 4 M, 10" 5 M, 10" 6 M, 10" 17 M, 10" 8 M, and 10" 9 M stock solutions. In 10 cm petri dishes 
30 (bacteriological grade, Greiner) medium was prepared consisting of IMDM supplemented 
with 20% FBS and retinoic acid added from above stocks to yield final concentrations of 10" 7 
M, 10" 8 M, 10" 9 M, 10~ 10 M, 10" u M, and 10' 12 M retinoic acid, respectively. One dish 
received DMSO (final concentration 0.1% (v/v)) without retinoic acid (solvent control). Each 
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dish contained 10 ml of medium. 70 EBs were transferred into each petri dish and incubated 
for 4 days. 

Subsequently, EBs were transferred to 6-well plates (tissue culture grade; Becton-Dickinson). 
5 Each well contained 3 ml EMDM with 20% FBS and retinoic acid and DMSO as described 
above for the culture in 10 cm dishes. 2 wells were set up for each concentration of retinoic 
acid and 2 wells as solvent controls. To each well 20 EBs were transferred from the respective 
10 cm dish. Plates were incubated for 7 days. Subsequently, culture medium was exchanged 
with 2 ml knockout™ D-MEM (Invitrogen) supplemented with 20% (v/v) knockout™ SR 
10 (Invitrogen). Samples of cell culture supernatants were collected after incubation of EBs for 
further two days. 

Determination of SEAP activity 

Samples were assayed for SEAP activity using the Great EscAPe SEAP Fluorescence 
15 Detection Kit (Becton-Dickinson) following the manufacturer's instructions. Fluorescence 
was measured using a microplate reader (Safire, Tecan). Background activity was determined 
by assaying a sample of fresh medium (knockout™ D-MEM supplemented with 20% (v/v) 
knockout™ SR). 

20 Results are shown in Fig. 1. Concentrations of retinoic acid below 10" 9 M had no influence on 
SEAP activity as compared to controls. SEAP activity decreased sharply at 10" 8 M retinoic 
acid. At higher concentrations no activity was observed. 

Thus, a concentration-dependent effect of retinoic acid on SEAP activity in culture 
25 supernatants is clearly observed. Moreover, these data are consistent with the concentration- 
dependent inhibition of cardiomyocyte generation by retinoic acid described previously 
(Seiler et al, 2002, supra). Therefore, the results indicate the suitability of the reporter system 
presented here to quantify ES cell derived cardiomyocytes and its usefulness for the 
monitoring of the differentiation process of ES cells 

30 
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SEQUENCE LISTING 

<110> Axiogenesis AG 

<120> Secreted Proteins As Markers For Cell Differentiation 

<130> AX02A03/P-US 

<160> 4 

<170> Patentln version 3.1 

<210> 1 

<211> 5443 

<212> DNA 

<213> Mus musculus 

<220> 

<221> promoter 

<222> (1) . . (5443) 

<223> alpha myosin heavy chain gene, promoter region 
<220> 

<221> promoter 

<222> (2) . . (5442) 

<223> promoter region contained in SEQ ID NO 3 
<220> 

<221> 5'UTR 

<222> (4369) . . (4389) 

<223> exonl 
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<220> 
5 <221> 5'UTR 

<222> (5071) . . (5139) 
<223> exon 2 

10 

<220> 
15 <221> 5'UTR 

<222> (5431) . . (5443) 
<223> exon 3 

20 



25 



30 



35 



40 



45 



50 



55 



60 



<400> 1 
ggatcctgca 


aggtcacaca 


agggtctcca 


cccaccaggt 


gccctagtct 


caatttcagt 


60 


ttccatgcct 


tgttctcaca 


atgctggcct 


ccccagagct 


aatttggact 


ttgtttttat 


120 


ttcaaaaggg 


cctgaatgag 


gagtagatct 


tgtgctaccc 


agctctaagg 


gtgcccgtga 


180 


agccctcaga 


cctggagcct 


ttgcaacagc 


cctttaggtg 


gaagcagaat 


aaagcaattt 


240 


tccttaaagc 


caaaatcctg 


cctctagact 


cttcttctct 


gacctcggtc 


cctgggctct 


300 


agggtgggga 


ggtggggctt 


ggaagaagaa ggtggggaag 


tggcaaaagc 


cgatccctag 


360 


ggccctgtga 


agttcggagc 


cttccctgta 


cagcactggc 


tcatagatcc 


tcctccagcc 


420 


aaacatagca 


agaagtgata 


cctcctttgt 


gacttcccca 


ggcccagtac 


ctgtcaggtt 


480 


gaaacaggat 


ttagagaagc 


ctctgaactc 


acctgaactc 


tgaagctcat 


ccaccaagca 


540 


agcacctagg 


tgccactgct 


agttagtatc 


ctacgctgat 


aatatgcaga 


gctgggccac 


600 


agaagtcctg 


gggtgtagga 


actgaccagt 


gacttttcag 


tcggcaaagg 


tatgaccccc 


660 


tcagcagatg 


tagtaatgtc 


cccttagatc 


ccatcccagg 


caggtctcta 


agaggacatg 


720 


ggatgagaga 


tgtagtcatg 


tggcattcca 


aacacagcta 


tccacagtgt 


cccttgcccc 


780 


ttccacttag 


ccaggaggac 


agtaacctta gcctatcttt 


cttcctcccc 


atcctcccag 


840 


gacacacccc 


ctggtctgca 


gtattcattt 


cttccttcac 


gtcccctctg 


tgacttccat 


900 


ttgcaaggct 


tttgacctct 


gcagctgctg gaagatagag 


tttggcccta 


ggtgtggcaa 


960 


gccatctcaa 


gagaaagcag 


acaacagggg 


gaccagattt 


tggaaggatc 


aggaactaaa 


1020 


tcactggcgg 


gcctgggggt 


agaaaaaaga gtgagtgagt 


ccgctccagc 


taagccaagc 


1080 


tagtccccga 


gatactctgc 


cacagctggg 


ctgctcgggg 


tagctttagg 


aatgtgggtc 


1140 


tgaaagacaa 


tgggattgga 


agacatctct 


ttgagtctcc 


cctcaacccc 


acctacagac 


1200 



acactcgtgt gtggccagac tcctgttcaa 
gcaaccagag catgggccct gtgctgagga 
5 caggggaggg agaacagaga acgaaataag 
tgcagtcaga agagatggga agccaacaca 
agattctggg cataaggagg ccacagaaag 

10 

ataccctcat cccgtctccc aattaagccc 
agcgaagaga cccgtaggga ggatcacact 
15 gcaacctaag agccagagcc taaaagagca 
ctgtgcacac agagggtcga ggactggtgg 
atgggcgggg ggggggattc tggggggggg 

20 

agaatctgga agcgctggaa acgataccat 
taaatcatga aagacaggga gaagggaagc 
25 catggaagca aggacaagcc aggttgagcg 
ccctggtatg agcaccagaa cagcagaggc 
gtagacacag gaacagacag agacggggga 

30 

ctgtggccag agcgtccatc tgtgtccaca 
tggcttggga ggcagctgga aagagtatgt 
35 aaggaagaag agggcaaacc aggccacaca 
cttccttgtt gcatcttcca taggaggcag 
gcccccacta cccataccaa gtttggcctg 

40 

cctggccttt gtctcttgga cctgacccaa 
gccctcaaga gcttgagaac caggcagtga 
45 tgcacacagg agcctcaagt gacctccagg 
ccaaagagca accatttggc ataggtggct 
ccttcaagaa tactgcatgc aagacctaag 

50 

ccacccacca taaggggagt gaactatcct 
attactgaga gtgctgagcc cagaaaaact 
55 tctagagcta tattgagagg tgacagtaga 
tcctgggtgt gagggtgtag gggaaagcca 
cacaatgtct acttagttat aacaggcatg 

60 

tgaaaagaca gcagccctgg aggacagggg 
ccacaaagga gggcatgagt gtgagtataa 
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cagccctctg 


tgttctgacc 


actgagctag 


1260 


tgaagagttg 


gttaccaata 


gcaaaaacag 


1320 


gaaggaagaa 


ggaaaggcca 


gtcaatcaga 


1380 


cagcttgagc 


agaggaaaca 


gaaaagggag 


1440 


aagagcccag 


gccccccaag 


tctcctcttt 


1500 


actcttcttc 


ctagatcaga 


cctgagctgc 


1560 


ggatgaagga 


gatgtgtgga 


gaagtccagg 


1620 


agagataaag gtgcttcaaa 


ggtggccagg 


1680 


tagagcctca 


agataaggat 


gatgctcaga 


1740 


agagagaagg 


tgagaaggag 


cctggaacag 


1800 


aaagggaaga 


acccaggcta 


cctttagatg 


1860 


tggagagagt 


agaaggaccc 


cggggcaaga 


1920 


ctccgtgaaa 


tcagcctgct 


gaaggcagag 


1980 


tagggttaat 


gtcgagacag 


ggaacagaag 


2040 


gccaggtaac 


aaaggaatgg 


tccttctcac 


2100 


tactctagaa 


tgttcatcag 


actgcagggc 


2160 


gagagccagg 


ggagacaagg 


gggcctagga 


2220 


agagggcaga 


gcccagaact 


gagttaactc 


2280 


tgggaactct 


gtgaccacca 


tcccccatga 


2340 


agtggcattc 


taggttccct 


gaggacagag 


2400 


gctgacccaa 


tgttctcagt 


accttatcat 


2460 


catattaggc 


catgggctaa 


ccctggagct 


2520 


gacacagctg 


cagacaggtg 


gcctttatcc 


2580 


gcaaatggga 


atgcaaggtt 


gaatcaggtc 


2640 


acccctggag 


agaggggtat 


gctcctgccc 


2700 


agggggctgg 


cgaccttggg 


gagacaccac 


2760 


gaccgccctg 


tgtcctgccc 


acctccacac 


2820 


tagggtggga 


gctggtagca 


gggagagtgt 


2880 


gagcagggga 


gtctggcttt 


gtctcctgaa 


2940 


acctgctaaa 


gacccaacat 


ctacgacctc 


3000 


ttgtctctga 


gccttgggtg 


cttgatggtg 


3060 


ggccccagga gcgttagaga 


agggcacttg 


3120 
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ggaaggggtc agtctgcaga gcccctatcc 
taaatctctg ggcctgccag gcattcaaag 

5 

ggcggaaggg agcaaccccc cacttatacc 
ctggccttcc cccttcccac ctcccatcag 
10 cctacatgtc caaacatcat ggtgcacgat 
ggaaatctgc aggcttaact gggttaatgt 
actgaaaacg ggcatggctg tgcagctgtt 

15 

ggtttgtgtg taaattgccc aaggcaaagt 
tggatgatgg cctgcatctc aaggaccatg 
20 ctaagctaag gtagcaaggt ctttggagga 
tacagacagt atctgtacag agtaaggaga 
tcaaagggaa actgagtcgt gcacctgcaa 

25 

tgtctctggg gagccagcac tgtggaactt 
gcctgaagct atgcagatag ccagggttga 
30 ttgtgtgttg gaggcagggg acagatatta 
ttgttcaact cacccttcag attaaaaata 
gtgtgagacg ctcctgtctc tcctctatct 

35 

gcccaaggac taaaaaaagg ccatggagcc 
gcaaaccttg gggccctgct gtcctcctgt 
40 ggagccagga caggagggaa gtgggaggga 
agcaggcata tgggatggga tataaagggg 
ccaacccagg taagagggag tttcgggtgg 

45 

acctagaagg aaactgcctt tcctggaagt 
gtggccttcc accagcctgg gaagttctca 
50 atgccccttc ccttacgctg tcttctccat 
ggtttatctt ggctcttcgt cttcagcaag 
tctactgtct ccgtgccttg ccttgccttc 
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acttcacctt ttctcccctt ctcatttgta 
cttccttcct tccttccttc ctttctccct 
60 cttccttcct gtgtcagagt gctgagaatc 
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4740 


ttcatccttc 


cttccttcct 


tccttccttc 


4800 


tccttccttc 


cttccttcct 


tccttccttc 


4860 


acacctgggg 


ttcccaccct 


tatgtaaaca 


4920 


ctgggtgctc 


tcttaccttc 


ctcaccccct 
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ggcttgtcct gttccatcct ggtcaggatc tctagattgg tctcccagcc tctgctactc 5040 

ctcttcctgc ctgttcctct ctctgtccag ctgcgccact gtggtgcctc gttccagctg 5100 

tggtccacat tcttcaggat tctctgaaaa gttaaccagg tgagaatgtt tcccctgtag 5160 

acagcagatc acgattctcc cggaagtcag gcttccagcc ctctctttct ctgcccagct 5220 

gcccggcact cttagcaaac ctcaggcacc cttaccccac atagacctct gacagagaag 52 80 

caggcacttt acatggagtc ctggtgggag agccataggc tacggtgtaa aagaggcagg 5340 

gaagtggtgg tgtaggaaag tcaggacttc acatagaagc ctagcccaca ccagaaatga 54 00 

15 cagacagatc cctcctatct cccccataag agtttgagtc gac 5443 

<210> 2 
20 <211> 6298 
<212> DNA 
<213> Mus musculus 

25 

<220> 

30 <221> promoter 

<222> (1) . . (5445) 

<223> ventricular myosin regulatory light chain gene (AF302688) 

35 

<220> 
40 <2 21> 5'UTR 

<222> (5446) . . (5486) 
<223> exon 1 

45 

<220> 
50 <221> exon 

<222> (5487) . . (5489) 
<223> exon 1 

55 

<220> 
60 <221> exon 

<222> (6209) . - (6298) 



<223> exon 2 

5 <400> 2 

ctcgagtgct gggattaaag gcatgcacca 

ttaatctttt taattatgtg tatgtatgtc 

10 tgccctggaa gaccagaagg gggtgacaaa 

tgctgggaat caaacttaat ttctctagaa 

ttgccagccc ttattttgtt tggttggttg 

15 

agtcttttat actctgccct catggcttct 
ttatcttcag aaagcaattg ttactttgac 
20 ctgaggctag aagaaaatgt accctccttc 
caggtaaagg tagcaggtcc aggttcagat 
cttcacttcc aagcttgccc cacctgatca 

25 

gcaggctctt caccccaaca ggagtcccca 
ccttatgaca accctgcgaa taaaatcgtc 
30 aagctgagag caaaaaccta cctgatgtca 
tcaaaagtga atattcccgc aggctccagt 
acatttgggt ccatgttctg aaatcctctg 

35 

ctaaatagct gagtccattc acctggccct 
ttcaggattc ttcaaacttg ttctgagtag 
40 ctcagcagaa agcccccatg tggcacaggc 
tcacaactgc acttcctctc atgccgaaga 
ggaaccatcg gatcctgtaa cccgcttctc 

45 

ctccctccgt ctgctctctt ctgcactgca 
gtttttgttt ggcttgtttc tttgcttgcc 
50 tagcaaaggc atttagtggg tcaaccaagc 
caaggtcatc ctttgctaca caatgagttc 
ctaaaataaa caaaaaataa aataattgat 

55 

tagactgttg gggatacatt ctccaagagt 
tgttgttact ataacacaac actagagtct 
60 tgggctggag agatggctca gagctgactg 
gcaaccacat ggtggctcga aaccatctgt 
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actcaggaga 
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ggtcagaatt 


1380 


agggtcagcc 


tgggctacac 


gagatccaat 


1440 


atttcttctc 


tgtatattat 


agttatactg 


1500 


ctcaaagttc 


atctccactg 


agttaattaa 


1560 


ggttcgttta 


caaatacaaa 


agatttattt 


1620 


ctcttccaaa 


ggtcctgagt 


tccaatccca 


1680 


aatgggatct 


gatgecttet 


tctagtgtgt 


1740 



ctgaagacag ctatggtgta ctcacataca 
atttgtttct acaaaggcta gatgctactg 
5 cttcttacta actcactaat cccatcttgg 
ctaacctcgt tcacctctcc tagaccttac 
atcaaatttt caacccagga gctttaggga 

10 

cccggtgtac actggatcct gcacacagta 
atgggggcaa aatcattttt gtgtttataa 
15 aattgtcaga gtaatttctc tctctctctc 
tgtgtctgtg tgtgtgtctg tgtctgtgtc 
agtgggtttg ggtacctgtc atgtggacat 

20 

ttgtgtgtca tttctcagag gtgatccatc 
gccggggact tactgttttt aggttacacc 
25 agacggtctg tctccatctc tccagcgaca 
tttgatgtgg gtctgggttt tgaggagtag 
cagccactga acataattca tatataatct 

30 

tgttttgttt tgttctttgc tgtgtattat 
ctacctggga gggacactga acagccagaa 
35 gtgagcccgg gaatgctggc agctgccggg 
ccgcatccac ggtggacgca gccctctgcc 
aggtagcgag cttggctgct ctgagctgcc 

40 

tgggaaagcg tgatcagctt gctttgttct 
ccaggactcc tgagtctctg ctccgtggta 
45 cagaaatctc acaagcgcat gagcttagag 
ggacaaagca ttgacagctc ctgtgtgcgc 
caggaactgt gagaggagtg cagagcccct 

50 

aggacccttg ggtttttaac accaaatcac 
gctcttcctc atacagcacg gtgggccagt 
55 ccaggctcac ggtaaactct aacctcaatc 
cttgcttctc agccaccgtg tggcacttgg 
gccaagggaa taggggcttt gcttaactgg 

60 

ctcactatgt agccaaggac gatcataaac 
gttgagataa caggtgtggg tcactcccta 
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1860 


gggggggagg 


gggaagcttc 


atgacttcac 
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gacaggttac 


ctctgtgggc 
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caagtcacgt 


gtgcctcaac 
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tcttgatcct 


cccgcttcag 


agtcctgggt 
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cccttcttct 


aatagcaatc 


aatgtgtggc 


3660 
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5759 


ctgatctcaa 


gaaatcagta 
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ccaggaagag 
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tctctctttt 


gaatattccc 


cttgaaatat 
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ctcccagaac 
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aggtagcccc 


cagctgctag 
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ata gaa ggc ggg age tec aac gtg ttc tec atg ttt gag cag acc cag 62 80 

lie Glu Gly Gly Ser Ser Asn Val Phe Ser Met Phe Glu Gin Thr Gin 
10 15 20 25 

ate cag gag ttc aag gaa 62 98 

lie Gin Glu Phe Lys Glu 
30 

<210> 3 
<211> 10475 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> composite vector 
<220> 

<221> promoter 

<222> (77) . . (5517) 

<223> 77-5517 Mus musculus alpha myosin heavy chain gene, promoter reg 
on (Acc. No. U71441) 
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<220> 

<221> CDS 
5 <222> (5558) . . (7117) 
<223> SEAP 

10 

<220> 

<221> rep_origin 
15 <222> (9750) (10393) 

<223> pUC plasmid replication origin 

20 

<220> 

<221> polyA_signal 
25 <222> (9401) . . (9406) 

<223> Herpes simplex virus thymidine kinase polyadenylation signal 

30 

<220> 

<221> polyA_signal 
35 <222> (9414) . . (9419) 

<22 3> Herpes simplex virus thymidine kinase polyadenylation signal 

40 

<220> 

<221> terminator 
45 <222> (9163) .. (9165) 

<223> Kanamycin/neomycin resistance gene 

50 

<220> 

<221> ATG 
55 <222> (8371) . . (8373) 

<223> Kanamycin/neomycin resistance gene 
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<220> 



<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 
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promoter 
(8242) . . (8248) 

SV4 0 early promoter element 



repeat_region 

(8167) . . (8230) 

SV4 0 early promoter 



enhancer 

(8018) . . (8163) 

SV4 0 early promoter 



rep_origin 
(8187) . . (8322) 
SV40 origin of replication 



-10_signal 
(7931) . . (7936) 

Ampicillin resistance (b- lactamase) promoter 



-35_signal 
(7908) . . (7913) 

Ampicillin resistance (b-lactamase) promoter 



<220> 
<221> 



polyA_signal 



<222> 
<223> 
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(7294) . . (7299) 

SV4 0 early mRNA polyadenylation signal 
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<220> 
<221> 

10 

<222> 
<223> 

15 

<220> 
<221> 

20 

<222> 
<223> 

25 

<220> 
<221> 

30 

<222> 
<223> 

35 

<220> 
<221> 

40 

<222> 
<223> 

45 

<400> 



polyA_signal 
(7323) . . (7328) 

SV4 0 early mRNA polyadenylation signal 

mature protein 
(5609) . . (7114) 
SEAP 

start codon 
(5558) . . (5560) 

ATG of the Secreted alkaline phosphatase (SEAP) gene 

sig_peptide 
(5558) . . (5608) 
SEAP 



tgg gctctagggt ggggaggtgg ggcttggaag aagaaggtgg ggaagtggca 420 
ate cctagggccc tgtgaagttc ggagccttcc ctgtacagca ctggctcata 480 



60 



50 acggtaccgc gggcccgatc ctgcaaggtc acacaagggt ctccacccac caggtgccct 



120 



agtctcaatt tcagtttcca tgccttgttc teacaatget ggcctcccca gagctaattt 



180 



ggactttgtt tttatttcaa aagggcctga atgaggagta gatcttgtgc tacccagctc 



240 



taagggtgcc cgtgaagccc tcagacctgg agectttgea acagcccttt aggtggaagc 



300 



agaataaagc aattttcctt aaagccaaaa tcctgcctct agactcttct tctctgacct 



360 



gatcctcctc cagccaaaca tagcaagaag 
agtacctgtc aggttgaaac aggatttaga 
5 ctcatccacc aagcaagcac ctaggtgcca 
gcagagctgg gccacagaag tcctggggtg 
aaaggtatga ccccctcagc agatgtagta 

10 

ctctaagagg acatgggatg agagatgtag 
agtgtccctt gccccttcca cttagccagg 
15 tccccatcct cccaggacac accccctggt 
ctctgtgact tccatttgca aggcttttga 
ccctaggtgt ggcaagccat ctcaagagaa 

20 

ggatcaggaa ctaaatcact ggcgggcctg 
ccagctaagc caagctagtc cccgagatac 
25 ttaggaatgt gggtctgaaa gacaatggga 
accccaccta cagacacact cgtgtgtggc 
tgaccactga gctaggcaac cagagcatgg 

30 

caatagcaaa aacagcaggg gagggagaac 
ggccagtcaa tcagatgcag tcagaagaga 
35 aaacagaaaa gggagagatt ctgggcataa 
ccaagtctcc tctttatacc ctcatcccgt 
tcagacctga gctgcagcga agagacccgt 

40 

gtggagaagt ccagggcaac ctaagagcca 
tcaaaggtgg ccaggctgtg cacacagagg 
45 aggatgatgc tcagaatggg cggggggggg 
aggagcctgg aacagagaat ctggaagcgc 
ggctaccttt agatgtaaat catgaaagac 

50 

gaccccgggg caagacatgg aagcaaggac 
ctgctgaagg cagagccctg gtatgagcac 
55 gacagggaac agaaggtaga cacaggaaca 
aatggtcctt ctcacctgtg gccagagcgt 
atcagactgc agggctggct tgggaggcag 

60 

caagggggcc taggaaagga agaagagggc 
gaactgagtt aactccttcc ttgttgcatc 
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tgatacctcc 


tttgtgactt 


ccccaggccc 


540 


gaagcctctg 


aactcacctg 


aactctgaag 


600 


ctgctagtta 


gtatcctacg 


ctgataatat 


660 


taggaactga 


ccagtgactt 


ttcagtcggc 


720 


atgtcccctt 


agatcccatc 


ccaggcaggt 


780 


tcatgtggca 


ttccaaacac 


agctatccac 


840 


aggacagtaa 


ccttagccta 


tctttcttcc 


900 


ctgcagtatt 


catttcttcc 


ttcacgtccc 


960 


cctctgcagc 


tgctggaaga 


tagagtttgg 


1020 


agcagacaac 


agggggacca 


gattttggaa 


1080 


99ggtagaaa 


aaagagtgag 


tgagtccgct 


1140 


tctgccacag 


ctgggctgct 


cggggtagct 


1200 


ttggaagaca 


tctctttgag 


tctcccctca 


1260 


cagactcctg 


ttcaacagcc 


ctctgtgttc 


1320 


gccctgtgct 


gaggatgaag 


agttggttac 


1380 


agagaacgaa 


ataaggaagg 


aagaaggaaa 


1440 


tgggaagcca 


acacacagct 


tgagcagagg 


1500 


ggaggccaca 


gaaagaagag 


cccaggcccc 


1560 


ctcccaatta 


agcccactct 


tcttcctaga 


1620 


agggaggatc 


acactggatg 


aaggagatgt 


1680 


gagcctaaaa 


gagcaagaga 


taaaggtgct 


1740 


gtcgaggact 


ggtggtagag 


cctcaagata 


1800 


gattctgggg 


gggggagaga 


gaaggtgaga 


1860 


tggaaacgat 


accataaagg 


gaagaaccca 


1920 


agggagaagg 


gaagctggag 


agagtagaag 


1980 


aagccaggtt 


gagcgctccg 


tgaaatcagc 


2040 


cagaacagca 


gaggctaggg 


ttaatgtcga 


2100 


gacagagacg 


ggggagccag 


gtaacaaagg 


2160 


ccatctgtgt 


ccacatactc 


tagaatgttc 


2220 


ctggaaagag 


tatgtgagag 


ccaggggaga 


2280 


aaaccaggcc 


acacaagagg 


gcagagccca 


2340 


ttccatagga 


ggcagtggga 


actctgtgac 


2400 
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caccatcccc catgagcccc cactacccat 
tccctgagga cagagcctgg cctttgtctc 

5 

tcagtacctt atcatgccct caagagcttg 
gctaaccctg gagcttgcac acaggagcct 
10 aggtggcctt tatccccaaa gagcaaccat 
aggttgaatc aggtcccttc aagaatactg 
ggtatgctcc tgcccccacc caccataagg 

15 

ttggggagac accacattac tgagagtgct 
tgcccacctc cacactctag agctatattg 
20 tagcagggag agtgttcctg ggtgtgaggg 
gctttgtctc ctgaacacaa tgtctactta 
aacatctacg acctctgaaa agacagcagc 

25 

gggtgcttga tggtgccaca aaggagggca 
agagaagggc acttgggaag gggtcagtct 
30 tggggccaac tggtgtaaat ctctgggcct 
ctggcagcct ggggaggcgg aagggagcaa 
ccaggattaa cacctctggc cttccccctt 

35 

gagggagggt aaaaacctac atgtccaaac 
tgtagaggca agaaaggaaa tctgcaggct 
40 atgtgtgtgt gtctgactga aaacgggcat 
ggttaccaga ctgcaggttt gtgtgtaaat 
atggtttaaa gagattggat gatggcctgc 

45 

actctatatg tgtctctaag ctaaggtagc 
gtgggcaaca gagactacag acagtatctg 
50 agaattctct tactatcaaa gggaaactga 
tagacatcat gactttgtct ctggggagcc 
aggaggctcc cctcagcctg aagctatgca 

55 

gcctgggatg ggagcttgtg tgttggaggc 
gtgaccctta cccagttgtt caactcaccc 
60 tgggtagggg aggtggtgtg agacgctcct 
ggggaggagg aatgtgccca aggactaaaa 



accaagtttg gcctgagtgg 


cat tctaggt 


2460 


ttggacctga 


cccaagctga 


cccaatgttc 


2520 


agaaccaggc 


agtgacatat 


taggccatgg 


2580 


caagtgacct 


ccagggacac 


agctgcagac 


2640 


ttggcatagg 


tggctgcaaa 


tgggaatgca 


2700 


catgcaagac 


ctaagacccc 


tacracraaaaa 


2760 


ggagtgaact 


atcctagggg 


acti act caacc 


2820 


gagcccagaa 


aaactgaccg 


ccctatatcc 


2 880 


agaggtgaca 


gtagataggg 


taaaaactQQ 
L yyy a y u y y 


2 940 


tgtaggggaa 


agccagagca 


yyy3 a r ^-y 


3000 


gttataacag 


gcatgacctg 


ctaaaaaccc 


3060 


cc tggaggac 


aggggttgtc 


i- i— i_y ay v~ l, l. i_ 


3120 


tgagtgtgag 


tataaggccc 


^^yy y y 


3180 


gcagagcccc 


tatccatgga 


a w t« y y c*. ^ ^ \* 


3240 


gccaggcatt 


caaagcagca 


rctacatcct 


3300 


ccccccactt 


ataccctttc 


tccctcaocc 


3360 


cccacctccc 


atcaggagtg 


y "■yy y l. 


3420 


atcatggtgc 


acgatatatg 




3480 


taactgggtt 


aatgtgtaaa 


crt"r , 't~crt~ai~ err* 


3540 


ggctgtgcag 


ctgttcagtt 


r , 1~cit'ac , at~aa 
\^ ^y y y ^ y ^ 


3600 


tgcccaaggc 


aaagtgggtg 


33■t - ^ , ^ , ^'t - ^~^"^■ 

ClU I— <w \_- ^. 1— ^ ^ 


3660 


atctcaagga 


ccatggaaaa 


tagaatggac 


3720 


aaggtctttg gaggacacct 


at cfcaaaaat 


3780 


tacagagtaa 


ggagagagag 


o a nncrnat" at" 
y ciyyyyy L.y i_ 


3840 


gtcgtgcacc 


tgcaaagtgg 


ciuyv. i«. \— ■ 


3900 


agcactgtgg 


aacttcaggt 


\»» u y ay ay ay l. 


3960 


gatagccagg 


gttgaaaggg 


ggaagggaga 


4020 


aggggacaga 


tattaagcct 


ggaagagaag 


4080 


ttcagattaa 


aaataactga 


ggtaagggee 


4140 


gtctctcctc 


tatctgccca 


tcggcccttt 


4200 


aaaggccatg 


gagccagagg 


ggegagggea 


4260 
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acagaccttt 


catgggcaaa 


ccttggggcc 


ctgctgtcct 


cctgtcacct 


ccagagccaa 


4320 




gggatcaaag 


gaggaggagc 


caggacagga 


gggaagtggg 


agggagggtc 


ccagcagagg 


4380 


5 


actccaaatt 


taggcagcag 


gcatatggga 


tgggatataa 


aggggctgga 


gcactgagag 


4440 




ctgtcagaga 


tttctccaac 


ccaggtaaga 


gggagtttcg 


ggtgggggct 


cttcacccac 


4500 


10 


accagacctc 


tccccaccta 


gaaggaaact 


gcctttcctg 


gaagtggggt 


tcaggccggt 


4560 


cagagatctg 


acagggtggc 


cttccaccag 


cctgggaagt 


tctcagtggc 


aggaggtttc 


4620 




cacaagaaac 


actggatgcc 


ccttccctta 


cgctgtcttc 


tccatcttcc 


tcctggggat 


4680 


15 


gctcctcccc 


gtcttggttt 


atcttggctc 


ttcgtcttca 


gcaagatttg 


ccctgtgctg 


4740 




tccactccat 


ctttctctac 


tgtctccgtg 


ccttgccttg 


ccttcttgcg 


tgtccttcct 


4800 


20 


ttccacccat 


ttctcacttc 


accttttctc 


cccttctcat 


ttgtattcat 


ccttccttcc 


4860 


ttccttcctt 


ccttccttcc 


ttccttcctt 


ccttcctttc 


tcccttcctt 


ccttccttcc 


4920 




ttccttcctt 


ccttccttcc 


ttcctgtgtc 


agagtgctga 


gaatcacacc 


tggggttccc 


4980 


25 


acccttatgt 


aaacaatctt 


ccagtgagcc 


acagcttcag 


tgctgctggg 


tgctctctta 


5040 




ccttcctcac 


cccctggctt 


gtcctgttcc 


atcctggtca 


ggatctctag 


attggtctcc 


5100 


30 


cagcctctgc 


tactcctctt 


cctgcctgtt 


cctctctctg 


tccagctgcg 


ccactgtggt 


5160 


gcctcgttcc 


agctgtggtc 


cacattcttc 


aggattctct 


gaaaagttaa 


ccaggtgaga 


5220 




atgtttcccc 


tgtagacagc 


agatcacgat 


tctcccggaa 


gtcaggcttc 


cagccctctc 


5280 


35 


tttctctgcc 


cagctgcccg 


gcactcttag 


caaacctcag 


gcacccttac 


cccacataga 


5340 




cctctgacag 


agaagcaggc 


actttacatg 


gagtcctggt 


gggagagcca 


taggctacgg 


5400 


40 


tgtaaaagag 


gcagggaagt 


ggtggtgtag 


gaaagtcagg 


acttcacata 


gaagcctagc 


5460 


ccacaccaga 


aatgacagac 


agatccctcc 


tatctccccc 


ataagagttt 


gagtcgaggg 


5520 




atctaagcag 


aagcttcgaa 


tcgcgaattc 


gcccacc atg ctg ctg ctg ctg ctg 


5575 



Met Leu Leu Leu Leu Leu 
45 15 

ctg ctg ggc ctg agg eta cag etc tec ctg ggc ate ate cca gtt gag 562 3 

Leu Leu Gly Leu Arg Leu Gin Leu Ser Leu Gly lie lie Pro Val Glu 
10 15 20 

50 

gag gag aac ccg gac ttc tgg aac cgc gag gca gec gag gec ctg ggt 5671 
Glu Glu Asn Pro Asp Phe Trp Asn Arg Glu Ala Ala Glu Ala Leu Gly 
25 30 35 

55 gec gee aag aag ctg cag cct gca cag aca gec gec aag aac etc ate 5719 
Ala Ala Lys Lys Leu Gin Pro Ala Gin Thr Ala Ala Lys Asn Leu lie 
40 45 50 

ate ttc ctg ggc gat ggg atg ggg gtg tct acg gtg aca get gee agg 5767 
60 lie Phe Leu Gly Asp Gly Met Gly Val Ser Thr Val Thr Ala Ala Arg 
55 60 65 70 

ate eta aaa ggg cag aag aag gac aaa ctg ggg cct gag ata ccc ctg 5815 
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He Leu Lys Gly Gin Lys Lys Asp Lys Leu Gly Pro Glu He Pro Leu 

75 80 85 

gcc atg gac cgc ttc cca tat gtg get ctg tec aag aca tac aat gta 

Ala Met Asp Arg Phe Pro Tyr Val Ala Leu Ser Lys Thr Tyr Asn Val 

90 95 100 

gac aaa cat gtg cca gac agt gga gcc aca gcc acg gcc tac ctg tgc 

Asp Lys His Val Pro Asp Ser Gly Ala Thr Ala Thr Ala Tyr Leu Cys 

105 110 115 

999 gtc aag ggc aac ttc cag acc att ggc ttg agt gca gcc gcc cgc 

Gly Val Lys Gly Asn Phe Gin Thr He Gly Leu Ser Ala Ala Ala Arg 

120 125 130 

ttt aac cag tgc aac acg aca cgc ggc aac gag gtc ate tec gtg atg 

Phe Asn Gin Cys Asn Thr Thr Arg Gly Asn Glu Val He Ser Val Met 

135 140 145 150 

aat egg gcc aag aaa gca ggg aag tea gtg gga gtg gta acc acc aca 

Asn Arg Ala Lys Lys Ala Gly Lys Ser Val Gly Val Val Thr Thr Thr 

155 160 165 

cga gtg cag cac gcc teg cca gcc ggc acc tac gcc cac acg gtg aac 

Arg Val Gin His Ala Ser Pro Ala Gly Thr Tyr Ala His Thr Val Asn 

170 175 180 

cgc aac tgg tac teg gac gcc gac gtg cct gcc teg gcc cgc cag gag 

Arg Asn Trp Tyr Ser Asp Ala Asp Val Pro Ala Ser Ala Arg Gin Glu 

185 190 195 

999 tgc cag gac ate get acg cag etc ate tec aac atg gac att gac 

Gly Cys Gin Asp He Ala Thr Gin Leu He Ser Asn Met Asp He Asp 

200 205 210 

gtg ate eta ggt gga ggc cga aag tac atg ttt cgc atg gga acc cca 

Val He Leu Gly Gly Gly Arg Lys Tyr Met Phe Arg Met Gly Thr Pro 

215 220 225 230 

gac cct gag tac cca gat gac tac age caa ggt ggg acc agg ctg gac 

Asp Pro Glu Tyr Pro Asp Asp Tyr Ser Gin Gly Gly Thr Arg Leu Asp 

235 240 245 

999 aag aat ctg gtg cag gaa tgg ctg gcg aag cgc cag ggt gcc egg 

Gly Lys Asn Leu Val Gin Glu Trp Leu Ala Lys Arg Gin Gly Ala Arg 

250 255 260 

tat gtg tgg aac cgc act gag etc atg cag get tec ctg gac ccg tct 
Tyr Val Trp Asn Arg Thr Glu Leu Met Gin Ala Ser Leu Asp Pro Ser 

265 270 275 

gtg acc cat etc atg ggt etc ttt gag cct gga gac atg aaa tac gag 

Val Thr His Leu Met Gly Leu Phe Glu Pro Gly Asp Met Lys Tyr Glu 

280 285 290 

ate cac cga gac tec aca ctg gac ccc tec ctg atg gag atg aca gag 
He His Arg Asp Ser Thr Leu Asp Pro Ser Leu Met Glu Met Thr Glu 

295 300 305 310 

get gcc ctg cgc ctg ctg age agg aac ccc cgc ggc ttc ttc etc ttc 
Ala Ala Leu Arg Leu Leu Ser Arg Asn Pro Arg Gly Phe Phe Leu Phe 

315 320 325 
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gtg gag ggt ggt cgc ate gac cat ggt cat cat gaa age agg get tac 
Val Glu Gly Gly Arg lie Asp His Gly His His Glu Ser Arg Ala Tyr 
330 335 340 

egg gca ctg act gag acg ate atg ttc gac gac gee att gag agg gcg 
Arg Ala Leu Thr Glu Thr lie Met Phe Asp Asp Ala lie Glu Arg Ala 
345 350 355 

ggc cag etc acc age gag gag gac acg ctg age etc gtc act gee gac 
Gly Gin Leu Thr Ser Glu Glu Asp Thr Leu Ser Leu Val Thr Ala Asp 
360 365 370 

cac tec cac gtc ttc tec ttc gga ggc tac ccc ctg cga ggg age tec 
His Ser His Val Phe Ser Phe Gly Gly Tyr Pro Leu Arg Gly Ser Ser 
375 380 385 390 

ate ttc ggg ctg gee cct ggc aag gee egg gac agg aag gec tac acg 
lie Phe Gly Leu Ala Pro Gly Lys Ala Arg Asp Arg Lys Ala Tyr Thr 
395 400 405 

gtc etc eta tac gga aac ggt cca ggc tat gtg etc aag gac ggc gee 
Val Leu Leu Tyr Gly Asn Gly Pro Gly Tyr Val Leu Lys Asp Gly Ala 
410 415 420 

egg ccg gat gtt acc gag age gag age ggg age ccc gag tat egg cag 
Arg Pro Asp Val Thr Glu Ser Glu Ser Gly Ser Pro Glu Tyr Arg Gin 
425 430 435 

cag tea gca gtg ccc ctg gac gaa gag acc cac gca ggc gag gac gtg 
Gin Ser Ala Val Pro Leu Asp Glu Glu Thr His Ala Gly Glu Asp Val 
440 445 450 

gcg gtg ttc gcg cgc ggc ccg cag gcg cac ctg gtt cac ggc gtg cag 
Ala Val Phe Ala Arg Gly Pro Gin Ala His Leu Val His Gly Val Gin 
455 460 465 470 

gag cag acc ttc ata gcg cac gtc atg gee ttc gec gee tgc ctg gag 
Glu Gin Thr Phe lie Ala His Val Met Ala Phe Ala Ala Cys Leu Glu 
475 480 485 

ccc tac acc gee tgc gac ctg gcg ccc ccc gec ggc acc acc gac gec 
Pro Tyr Thr Ala Cys Asp Leu Ala Pro Pro Ala Gly Thr Thr Asp Ala 
490 495 500 

gcg cac ccg ggt tac tct aga gtc ggg gcg gee ggc cgc ttc gag cag 
Ala His Pro Gly Tyr Ser Arg Val Gly Ala Ala Gly Arg Phe Glu Gin 
505 510 515 

aca tga taagatacat tgatgagttt ggacgcggcc gcgactctag atcataatca 
Thr 



gccataccac atttgtagag gttttacttg ctttaaaaaa cctcccacac ctccccctga 
acctgaaaca taaaatgaat gcaattgttg ttgttaactt gtttattgea gcttataatg 
gttacaaata aagcaatagc atcacaaatt tcacaaataa agcatttttt teactgeatt 
ctagttgtgg tttgtccaaa ctcatcaatg tatcttaagg cgtaaattgt aagcgttaat 
attttgttaa aattcgcgtt aaatttttgt taaatcagct cattttttaa ccaataggcc 
gaaateggea aaatccctta taaatcaaaa gaatagaccg agatagggtt gagtgttgtt 



ccagtttgga acaagagtcc actattaaag 
accgtctatc agggcgatgg cccactacgt 

5 

tcgaggtgcc gtaaagcact aaatcggaac 
cggggaaagc cggcgaacgt ggcgagaaag 
10 agggcgctgg caagtgtagc ggtcacgctg 
gcgccgctac agggcgcgtc aggtggcact 
tgtttatttt tctaaataca ttcaaatatg 

15 

atgcttcaat aatattgaaa aaggaagagt 
tgtgtcagtt agggtgtgga aagtccccag 
20 tgcatctcaa ttagtcagca accaggtgtg 
gtatgcaaag catgcatctc aattagtcag 
tcccgcccct aactccgccc agttccgccc 

25 

ttatttatgc agaggccgag gccgcctcgg 
gcttttttgg aggcctaggc ttttgcaaag 
30 cgcatgattg aacaagatgg attgcacgca 
ttcggctatg actgggcaca acagacaatc 
tcagcgcagg ggcgcccggt tctttttgtc 

35 

ctgcaagacg aggcagcgcg gctatcgtgg 
gtgctcgacg ttgtcactga agcgggaagg 
40 caggatctcc tgtcatctca ccttgctcct 
atgcggcggc tgcatacgct tgatccggct 
cgcatcgagc gagcacgtac tcggatggaa 

45 

gaagagcatc aggggctcgc gccagccgaa 
gacggcgagg atctcgtcgt gacccatggc 
50 aatggccgct tttctggatt catcgactgt 
gacatagcgt tggctacccg tgatattgct 
ttcctcgtgc tttacggtat cgccgctccc 

55 

cttgacgagt tcttctgagc gggactctgg 
acctgccatc acgagatttc gattccaccg 
60 tcgttttccg ggacgccggc tggatgatcc 
tcgcccaccc tagggggagg ctaactgaaa 
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aacgtggact 


ccaacgtcaa 


agggegaaaa 


7587 


gaaccatcac 


cctaatcaag 


ttttttqqqq 


7647 


cc taaaggga 


gcccccgat t 


tagagct tga 


7707 


gaagggaaga 


aagegaaagg 


aQCQQQCQCt 


7767 


cgcgtaacca 


ccacacccgc 


cgcgcttaat 


7827 


tttcaaaaaa 


atotacacQQ 


aacccctatt 


7887 


ta he cere tea 


tgagacaata 


accctgat aa 


7947 


cc tcf acrcicacr 


aaagaaccag 


c tgtggaatg 


8007 


*H * — ■ w w a. w 


aggcagaagt 


atgeaaagea 


8067 


gaaagt cccc 


aggc t cccca 


gcaggcagaa 


8127 


caacnataat 


cccacccct.ci 

W V- 1 1 *w v-» U wt 


actccaccca 


8187 


attctccgcc 


ccatggctga 


ctaatttttt 


8247 


cc tctgagc t 


at tccagaag 


taataaaaaa 

a 3J a 3 


8307 


at cgat caag 


agacaggatg 


aggat cgtt t 


8367 


aattctccaa 


ccacttaaat 


aaaaaaacta 

?3 ^ t * 


8427 


aac tact eta 


atgccgccgt 


gt t cegge tg 


8487 


aaaaccaacc 


tatccaatac 


cc tgaatgaa 


8547 


ctaaccacaa 


caoacat.t.cc 

^ ^ L- 1- W ■ 


t tgegcaget 


8607 




tattaaocaa 

«p ^ y *J w *H c*. 


•ay L y-^yyyy 


8667 




tatrcatcat 


aactoataca 

33 3 -J 


8727 




trcrarrarra 


aoroa aarat - 


8787 


accaat efc ta 

3 33 3 


t C03 t CaaOa 
^ Gl ^ w tA. CI 


taat ctaaac 


8847 


c tat t caeca 


aact caaaac 


gagcatgccc 


8907 


aatacctact 


taccaaatat 


cataataaaa 


8967 




nr t cr t era c an a 

y L y *-yy *-yy 


ecac tat caa 

^ l— Q- CI- vJ 


9027 


era act a nrt* t* ct 


crr , crcfc , cia ahn 

3 33 3 etc*, i— y 


era rfaafperr 


9087 


gattegcage 


gcatcgcctt 


ctatcgcctt 


9147 


ggttcgaaat 


gaccgaccaa 


gcgacgccca 


9207 


ccgccttcta 


tgaaaggttg 


ggcttcggaa 


9267 


tccagcgcgg 


ggatctcatg 


ctggagttct 


9327 


caeggaagga 


gacaataccg 


gaaggaaccc 


9387 



gcgctatgac ggcaataaaa agacagaata 
aacgcggggt tcggtcccag ggctggcact 
5 gccaatacgc ccgcgtttct tccttttccc 
cagggctcgc agccaacgtc ggggcggcag 
tactttagat tgatttaaaa cttcattttt 

10 

ttgataatct catgaccaaa atcccttaac 
ccgtagaaaa gatcaaagga tcttcttgag 
15 tgcaaacaaa aaaaccaccg ctaccagcgg 
ctctttttcc gaaggtaact ggcttcagca 
tgtagccgta gttaggccac cacttcaaga 

20 

tgctaatcct gttaccagtg gctgctgcca 
actcaagacg atagttaccg gataaggcgc 
25 cacagcccag cttggagcga acgacctaca 
gagaaagcgc cacgcttccc gaagggagaa 
tcggaacagg agagcgcacg agggagcttc 

30 

ctgtcgggtt tcgccacctc tgacttgagc 
ggagcctatg gaaaaacgcc agcaacgcgg 
35 cttttgctca catgttcttt cctgcgttat 
ccatgcat 

40 <210> 4 

<211> 519 

<212> PRT 

45 

<213> SEAP 
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aaacgcacgg 


tgttgggtcg 


tttgttcata 


9447 


ctgtcgatac 


cccaccgaga 


ccccattggg 


9507 


caccccaccc 


cccaagttcg 


ggtgaaggcc 


9567 


gccctgccat 


agcctcaggt 


tactcatata 


9627 


aatttaaaag 


gatctaggtg 


aagatccttt 


9687 


gtgagttttc 


gttccactga 


gcgtcagacc 


9747 


atcctttttt 


tctgcgcgta 


atctgctgct 


9807 


tggtttgttt 


gccggatcaa 


gagctaccaa 


9867 


gagcgcagat 


accaaatact 


gtccttctag 


9927 


actctgtagc 


accgcctaca 


tacctcgctc 


9987 


gtggcgataa 


gtcgtgtctt 


accgggttgg 


10047 


agcggtcggg 


ctgaacgggg 


ggttcgtgca 


10107 


ccgaactgag 


atacctacag 


cgtgagctat 


10167 


aggcggacag 


gtatccggta 


agcggcaggg 


10227 


cagggggaaa 


cgcctggtat 


ctttatagtc 


10287 


gtcgattttt 


gtgatgctcg 


tcaggggggc 


10347 


cctttttacg 


gttcctggcc 


ttttgctggc 


10407 


cccctgattc 


tgtggataac 


cgtattaccg 


10467 



10475 



50 

<400> 4 

Met Leu Leu Leu Leu Leu Leu Leu Gly Leu Arg Leu Gin Leu Ser Leu 
15 10 15 

55 



Gly lie lie Pro Val Glu Glu Glu Asn Pro Asp Phe Trp Asn Arg Glu 
20 25 30 
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Ala Ala Glu Ala Leu Gly Ala Ala Lys Lys Leu Gin Pro Ala Gin Thr 
35 40 45 
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Ala Ala Lys Asn Leu He He Phe Leu Gly Asp Gly Met Gly Val Ser 
50 55 60 



Thr Val Thr Ala Ala Arg He Leu Lys Gly Gin Lys Lys Asp Lys Leu 
65 70 75 80 



Gly Pro Glu He Pro Leu Ala Met Asp Arg Phe Pro Tyr Val Ala Leu 
85 90 95 



Ser Lys Thr Tyr Asn Val Asp Lys His Val Pro Asp Ser Gly Ala Thr 
100 105 110 



Ala Thr Ala Tyr Leu Cys Gly Val Lys Gly Asn Phe Gin Thr He Gly 
115 120 125 



Leu Ser Ala Ala Ala Arg Phe Asn Gin Cys Asn Thr Thr Arg Gly Asn 
130 135 140 



Glu Val lie Ser Val Met Asn Arg Ala Lys Lys Ala Gly Lys Ser Val 
145 150 155 160 



Gly Val Val Thr Thr Thr Arg Val Gin His Ala Ser Pro Ala Gly Thr 
165 170 175 



Tyr Ala His Thr Val Asn Arg Asn Trp Tyr Ser Asp Ala Asp Val Pro 
180 185 190 



Ala Ser Ala Arg Gin Glu Gly Cys Gin Asp He Ala Thr Gin Leu He 
195 200 205 



Ser Asn Met Asp He Asp Val He Leu Gly Gly Gly Arg Lys Tyr Met 
210 215 220 



Phe Arg Met Gly Thr Pro Asp Pro Glu Tyr Pro Asp Asp Tyr Ser Gin 
225 230 235 240 



Gly Gly Thr Arg Leu Asp Gly Lys Asn Leu Val Gin Glu Trp Leu Ala 
245 250 255 



Lys Arg Gin Gly Ala Arg Tyr Val Trp Asn Arg Thr Glu Leu Met Gin 
260 265 270 



Ala Ser Leu Asp Pro Ser Val Thr His Leu Met Gly Leu Phe Glu Pro 
275 280 285 



Gly Asp Met Lys Tyr Glu He His Arg Asp Ser Thr Leu Asp Pro Ser 
290 295 300 
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Leu Met Glu Met Thr Glu Ala Ala Leu Arg Leu Leu Ser Arg Asn Pro 
305 310 315 320 



Arg Gly Phe Phe Leu Phe Val Glu Gly Gly Arg He Asp His Gly His 
325 330 335 



His Glu Ser Arg Ala Tyr Arg Ala Leu Thr Glu Thr He Met Phe Asp 
340 345 350 



Asp Ala He Glu Arg Ala Gly Gin Leu Thr Ser Glu Glu Asp Thr Leu 
355 360 365 



Ser Leu Val Thr Ala Asp His Ser His Val Phe Ser Phe Gly Gly Tyr 
370 375 380 



Pro Leu Arg Gly Ser Ser He Phe Gly Leu Ala Pro Gly Lys Ala Arg 
385 390 395 400 



Asp Arg Lys Ala Tyr Thr Val Leu Leu Tyr Gly Asn Gly Pro Gly Tyr 
405 410 415 



Val Leu Lys Asp Gly Ala Arg Pro Asp Val Thr Glu Ser Glu Ser Gly 
420 425 430 



Ser Pro Glu Tyr Arg Gin Gin Ser Ala Val Pro Leu Asp Glu Glu Thr 
435 440 445 



His Ala Gly Glu Asp Val Ala Val Phe Ala Arg Gly Pro Gin Ala His 
450 455 460 



Leu Val His Gly Val Gin Glu Gin Thr Phe He Ala His Val Met Ala 
465 470 475 480 



Phe Ala Ala Cys Leu Glu Pro Tyr Thr Ala Cys Asp Leu Ala Pro Pro 
485 490 495 



Ala Gly Thr Thr Asp Ala Ala His Pro Gly Tyr Ser Arg Val Gly Ala 
500 505 510 



Ala Gly Arg Phe Glu Gin Thr 
515 
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Claims 

A method of monitoring cell differentiation comprising: 

(a) culturing cells capable of differentiating into at least one particular cell type 
containing at least one recombinant nucleic acid molecule comprising a 
reporter gene encoding a product that is secreted upon cell differentiation, or 
maintaining a non-human animal comprising such cells, under conditions 
allowing differentiation of the cells; and 

(b) determining the amount or activity of the reporter gene product either within a 
body fluid of said transgenic non-human animal or the cell culture medium. 

The method of claim 1, wherein said recombinant nucleic acid molecule comprises 
at least one cell type specific regulatory sequence operably linked to said reporter 
gene. 

The method of claim 1 or 2, wherein said cells are or are derived from stem cells 
or primordial germ cells, preferably human or rodent stem cells or primordial germ 
cells. 

The method of claim 3, wherein said stem cells are embryonic stem cells or 
multipotent adult progenitor cells (MAPCs). 

The method of any one of claims 1 to 4, wherein said reporter gene product 
comprises a secretory leader sequence. 

The method of any one of claims 2 to 5, wherein said regulatory sequence 
comprises a promoter and/or enhancer elements. 

The method of any one of claims 1 to 6, wherein said cell type is selected from the 
group consisting of connecting fibroblasts, stromal cells, endothelial cells, glial 
cells, neural cells, neuronal cells, hematopoietic cells, smooth muscle cells, 
skeletal muscle cells, epithelial cells, and cardiac cells. 
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The method of claim 6 or 7, wherein said promoter or enhancer is selected from 
the group consisting of aMHC, MLC2V, VE-cadherin, Tie-2, Flk-1, Flt-1, GFAP, 
alpha- 1 -tubulin and collagen 2 promoter or enhancer. 

The method of any of claims 1 to 8, wherein said reporter gene product is secreted 
alkaline phosphatase (SEAP) or alpha-amylase. 

The method of any one of claims 1 to 9, wherein said recombinant nucleic acid 
molecule further comprises a selectable marker expressed by multi- or pluripotent 
cells. 

The method of any one of claims 1 to 10, wherein said cells form cell aggregates 
or tissue-like aggregates derived from different cell types. 

The method of any one of claims 1 to 1 1, wherein said cells form embryoid bodies 
(EBs). 

A reporter gene construct for monitoring cell differentiation comprising a 
recombinant nucleic acid molecule as defined in any one of claims 1 to 12. 

A cell as defined in any one of claims 1 to 12 or comprising a reporter gene 
construct of claim 13, wherein said cell is capable of differentiating into at least 
one particular cell type. 

A cell aggregate of at least one cell type obtainable by the method of any one of 
claims 1 to 12. 

A tissue obtainable by the method of any one of claims 1 to 12 or comprising cells 
of claim 14 or a cell aggregate of claim 15. 

An organ comprising tissue of claim 16, cells of claim 14 or a cell aggregate of 
claim 15. 
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An implant or transplant comprising an organ of claim 17, tissue of claim 16, cells 
of claim 14 or a cell aggregrate claim 15. 

A non-human animal comprising a reporter gene construct of claim 13, a cell of 
claim 14, a cell aggregate of claim 15, a tissue of claim 16 or an organ of claim 17. 

A composition of matter comprising a reporter gene of claim 13, tissue of claim 
16, cells of claim 14 or a cell aggregate of claim 15. 

An array comprising a solid support and attached thereto or suspended thereon 
cells of claim 14, a cell aggregate of claim 15 or tissue of claim 16. 

An apparatus for analyzing the array of 21 . 

A method of obtaining and/or profiling a modulator of cell differentiation 
comprising: 

(a) contacting a test sample comprising a cell of claim 14, cell aggregate of 
claim 15, tissue of claim 16 or organ of claim 17 or a non-human animal of 
claim 19 with a test substance; and 

(b) determining the effect of the test substance on the amount of the reporter 
gene product or activity compared to a control sample or animal. 

The method of claim 23, wherein said contacting step further includes contacting 
said test sample or animal with at least one second test substance in the presence of 
said first test substance. 

The method of any one of claims 1 to 12 or 23 to 24, wherein a compound known 
to activate or inhibit differentiation process is added to the culture medium or 
animal. 

The method of any one of claims 23 to 25, wherein the test substance is a 
therapeutic agent. 



The method of any one of claims 23 to 26, wherein the test substance is a mixture 
of therapeutic agents. 

The method of any one of claims 23 to 27, wherein preferably in a first screen said 
test substance is comprised in and subjected as a collection of test substances. 

The method of claim 28, wherein said collection of test substances has a diversity 
of about 10 3 to about 10 5 . 

The method of claim 29, wherein the diversity of said collection of test substances 
is successively reduced. 

The method of any one of claims 23 to 30 which is performed on an array. 

The method of any one of claims 1 to 12 or 23 to 31, wherein said one or more 
cells are genetically engineered to (over) express or inhibit the expression of a 
target gene. 

The method of any one of claims 1 to 12 or 23 to 32, wherein said one or more 
cells or tissue are contained in a container. 

The method of any one of claims 1 to 13 or 25 to 34, comprising taking 3 or more 
measurements, optionally at different positions within the container. 

The method of any one of claims 33 to 34, wherein said container is a well in a 
microtiter plate. 

The method of claim 35, wherein said microtiter plate is a 24, 96, 384 or 1586 well 
plate. 

A method of obtaining and manufacturing a drug which promotes or inhibits 
formation of specific cell types comprising the steps of any one of claims 23 to 36, 
wherein an enhanced or reduced level or activity of the reporter gene product is 
indicative for the drug. 
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A method of manufacturing an agent which supports wound healing and/or healing 
of damaged tissue comprising the steps of any one of claims 23 to 37, wherein an 
enhanced level or activity of the reporter gene product is indicative for said agent. 

A method of determining toxicity, preferably teratogenicity, embryotoxicity, 
chronic or acute toxicity of a compound comprising the steps of the method of any 
one of claims 23 to 37, wherein a reduced or enhanced level or activity of said 
reporter gene product is indicative for the toxicity of the compound. 

The method of any one of claims 23 to 39 further comprising modifying said 
substance to alter, eliminate and/or derivatize a portion thereof suspected causing 
toxicity, increasing bioavailability, solubility and/or half-life. 

The method of any one of claims 23 to 40 further comprising mixing the substance 
isolated or modified with a pharmaceutical^ acceptable carrier. 

A kit useful for conducting a method of any one of claims 1 to 12 or 23 to 41, 
containing for example a reporter gene construct of claim 13, a cell of claim 14, 
and standard compounds, like cell culture media, selection agents, detection agents 
for the reporter molecule and control samples. 

A method of conducting a drug discovery business comprising: 

(a) providing one or more assay systems of any one of claims 1 to 12 or 23 to 41 
for identifying a modulator of cell differentiation; and/or 

(b) conducting therapeutic profiling of modulators identified in step (a), or 
further analogs thereof, for efficacy and toxicity in animals of claim 19; and 

(c) formulating a pharmaceutical preparation including one or more modulators 
identified in step (b) as having an acceptable therapeutic profile. 

A method of conducting a target discovery business comprising: 
(a) providing one or more assay systems of any one of claims 1 to 12 or 23 to 41 
for identifying modulators of cell differentiation; 
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(b) (optionally) conducting therapeutic profiling of modulators identified in step 
(a) for efficacy and toxicity in animals of claim 19; and 

(c) licensing, to a third party, the rights for further drug development and/or 
sales for modulators identified in step (a), or analogs thereof. 

A modulator of cell differentiation such as growth and tissue-formation promoting 
identified according to the method of any one of claims 23 to 41. 

A pharmaceutical composition for use in the modulation of cell differentiation 
comprising a modulator identified according to the method of any one of claims 23 
to 41. 

A method of making a pharmaceutical composition for use in modulating cell 
differentiation comprising mixing a modulator of cell differentiation identified 
according to a method of any one of claims 23 to 41 with a suitable diluent or 
carrier. 

Use of a reporter gene construct of claim 13, a cell of claim 14, a cell aggregate of 
claim 15, tissue of claim 16, organ of claim 17, the implant or transplant of claim 
18, a non-human animal of claim 19, the composition of claim 20, an array of 
claim 21 or the apparatus of claim 22 in drug discovery or pharmacokinetic or 
pharmacological profiling. 

A vector comprising the promoter region of the mouse alpha myosin heavy chain 
gene or of the ventricular myosin regulatory light chain gene, and operably linked 
thereto a heterologous DNA sequence. 

The vector of claim 49, wherein said promoter comprises the nucleotide sequence 
of SEQ ID NO: 1 or 2, or a fragment thereof. 

The vector of claim 49 or 50, wherein said heterologous DNA sequence encodes a 
reporter or a selectable marker. 
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The vector of any one of claims 49 to 51, wherein said DNA sequence encodes 
secreted alkaline phosphatase protein (SEAP). 

The vector of any one of claims 49 to 52 comprising the nucleotide sequence of 
SEQIDNO: 3. 

Use of a promoter region of the mouse alpha myosin heavy chain gene or of the 
ventricular myosin regulatory light chain gene for the specific expression of 
heterologous DNA sequences during embryogenesis or cell development. 
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i 

ABSTRACT 

Provided are means and methods for in vitro and in vivo detection of chemically induced 
effects on embryonic development and differentiation for the purpose of 
5 embryotoxicity/teratogenicity screening as well as for the identification of pharmaceuticals 
such as growth and tissue promoting factors based on differentiating pluripotent embryonic 
stem (ES) cells. The assays are based on the use of transgenic ES cells and non-human 
animals comprising such ES cells or derivatives thereof, wherein said ES cells are 
characterized by the differentiation-dependent expression of a secreted reporter protein; in 
10 particular secreted embryonic alkaline phosphatase (SEAP). 
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